Tech Stack
AWSCloudDistributed SystemsKubernetesMicroservicesTerraform
About the role
- Lead and manage a team of 6–8 platform engineers building scalable infrastructure, automation tools, internal platforms, and testing frameworks.
- Own end-to-end delivery of platform initiatives including cloud infrastructure, Kubernetes orchestration, CI/CD pipeline enhancements, IaC automation, and platform testing strategies.
- Establish and enforce best practices for infrastructure management, deployment processes, automated testing, monitoring, and incident response.
- Collaborate closely with product engineering teams, security, and IT to align platform capabilities and quality standards with business needs.
- Drive improvements in system reliability, performance, testing coverage, and developer productivity through automation and tooling.
- Identify bottlenecks in platform operations, development workflows, and testing processes; champion process improvements and technology upgrades.
- Foster a culture of knowledge sharing, continuous learning, high-quality engineering, and innovation within the platform team.
Requirements
- 10+ years of software engineering experience with exposure to platform engineering, infrastructure, DevOps, and testing/QA.
- 4–5 years of people management experience (managed teams).
- Hands-on experience with automated testing tools for functional and NFR testing.
- Proven experience leading and scaling engineering teams focused on platform or infrastructure projects.
- Strong expertise with cloud platforms (preferably AWS), Kubernetes, and infrastructure-as-code tools (Terraform, Pulumi).
- Deep understanding of CI/CD pipelines, automation frameworks, deployment strategies, and quality assurance best practices.
- Experience with monitoring, alerting, incident management, and testing of distributed systems.
- Excellent communication and leadership skills.
- (Good to have) Experience in fintech/payments or regulated industries; DORA framework familiarity; security and compliance knowledge; microservices and service mesh experience; test automation tools; Agile methodologies; cost optimization and cloud governance.