Tech Stack
AnsibleAWSCloudFluxGoJavaKubernetesPythonTerraform
About the role
- Own & Evolve Platforms: Design, build, and scale core platform components (EKS/ECS, GitHub-native CI/CD, OTEL observability, cost monitoring).
- Modern IaC & App Delivery: Architect and enforce Terraform standards (module catalogs, policy-as-code with OPA/Conftest), manage Kubernetes apps via Helm (charts, repos), and implement GitOps (Argo CD/Flux) for progressive delivery and compliance.
- Accelerate Developer Experience: Deliver self-service infrastructure, opinionated deployment patterns, and automation that enable teams to ship faster with confidence.
- Advance Reliability & Resilience: Lead implementation of autoscaling, canary releases, anomaly detection, rollback automation, and disaster recovery patterns.
- Cross-Functional Influence: Collaborate with Engineering, Product, Finance, and Security to align platform investments with organizational goals.
- Mentorship & Leadership: Guide engineers in platform best practices, review designs, and raise the technical bar across the org.
Requirements
- 5+ years of experience in platform, DevOps, or SRE roles.
- Strong proficiency in one or more programming languages (Python, Go, or Java).
- Expert-level experience with AWS (multi-account, ECS/EKS, IAM, cloud networking).
- Deep knowledge of containerization, orchestration, and Infrastructure as Code (Terraform, Ansible, CDK, or CloudFormation).
- Track record of designing and delivering production-grade, scalable platform solutions.
- Proven ability to collaborate across teams and influence technical direction.