Salary
💰 $160,000 - $210,000 per year
Tech Stack
AnsibleAWSCloudEC2KubernetesPrometheusPythonTerraform
About the role
- Design, implement, and maintain infrastructure across datacenters and hybrid cloud deployments
- Assess physical and network architectures for scalability, reliability, and cost efficiency
- Improve monitoring, incident management, and disaster recovery (including migration to Datadog)
- Implement infrastructure as code with Terraform and Ansible, automate using Python and Bash
- Monitor and maintain shared infrastructure in AWS ensuring availability and stability
- Work closely with engineering and product teams to scope projects to core business requirements
- Lead major service management initiatives and drive long-term roadmap
Requirements
- 7+ years in operations, engineering, or SRE with expertise in multi-datacenter deployments
- Deep knowledge of AWS infrastructure, networking, and service management practices
- Skilled in infrastructure as code and automation, with proficiency in Python and Bash
- Experience with Terraform and Ansible
- Some Kubernetes experience
- Willingness to travel 1–2 times per quarter for deployments (bonus)
- Self-starter with strong communication and collaboration skills