Salary
💰 $201,002 - $335,004 per year
Tech Stack
AnsibleAzureCloudDockerGoGrafanaKubernetesPrometheusPythonTerraform
About the role
- Design robust, scalable, highly available global cloud architectures that meet business requirements and align with industry best practices.
- Oversee the design, implementation and optimization of CI/CD pipelines to facilitate automated testing, deployment and monitoring of applications and infrastructure (GitHub Actions).
- Implement and manage Azure Kubernetes Service (AKS), Helm, and Docker-based container platforms.
- Automate infrastructure provisioning with tools like Terraform, Ansible, or similar IaC frameworks.
- Establish best practices for observability, logging, alerting and monitoring using tools like Prometheus and Grafana.
- Ensure networking, security, and compliance requirements are met across cloud environments.
- Champion collaboration with teams across the company to identify DevOps needs and deliver solutions that improve developer experience and release velocity.
- Provide escalation assistance to resolve technology stack related issues.
- Coach and mentor team members on best practices and standards in DevOps.
- Stay familiar with new industry trends in DevOps tools and concepts and perform additional duties as necessary.
Requirements
- Bachelor’s degree in Computer Science, Engineering, or a related field; Master’s degree preferred.
- 10+ years of experience in DevOps or related fields, with at least 5 years in an architecture role.
- Extensive experience with Microsoft Azure Stack, including Azure Kubernetes Service (AKS), and other Azure cloud services.
- Have extensive hands-on experience with Azure, Kubernetes (AKS), Helm, and Docker containerization.
- Experience managing and transitioning from monolithic to scalable, distributed architectures across multiple regions.
- Excellent understanding of CI/CD pipelines (GitHub) and experience overseeing/implementing CI/CD (GitHub Actions).
- Experience with infrastructure as code (IaC) and automation tools such as Terraform, Ansible, or equivalent.
- Experience establishing observability, logging, alerting and monitoring using tools like Prometheus and Grafana.
- Solid understanding of networking, security, and compliance in a global cloud environment.
- Background in compliance-heavy industries and integrating audit requirements with security standards (SOC2, ISO 27001, etc.).
- Experienced with scripting and automation (Python, Bash, Go, or similar).
- Exceptional problem-solving skills and ability to manage complex projects with multiple stakeholders.
- Excellent communication and interpersonal skills.