Tech Stack
AnsibleAWSCloudFluxGoogle Cloud PlatformGrafanaJenkinsKubernetesOpenStackPrometheusPythonTerraform
About the role
- Design and implement scalable, secure, and resilient cloud infrastructure using Kubernetes, Cloud Run, and AWS ECS
- Shape cloud strategy to ensure high availability, performance, and security across infrastructure
- Lead the architecture and deployment of containerized applications and services
- Develop and maintain CI/CD pipelines to support automated deployment and testing
- Monitor system performance, troubleshoot issues, and ensure system reliability and uptime
- Collaborate with Application Support, Security, and Application teams to align infrastructure with business goals
- Document infrastructure designs, processes, and procedures
- Mentor junior engineers and provide technical leadership across projects
Requirements
- 5+ years of experience in systems engineering or cloud infrastructure roles
- Deep expertise in Kubernetes (deployment, scaling, monitoring, and troubleshooting)
- Strong hands-on experience with OpenStack (compute, storage, networking)
- Proven experience designing and managing large-scale cloud environments
- Proficiency in scripting languages (e.g., Bash, Python)
- Experience with infrastructure-as-code tools (e.g., Terraform, Ansible)
- Solid understanding of networking, security, and system administration in cloud environments
- Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack)
- Certifications in Kubernetes (CKA/CKAD) or GCP or AWS (preferred)
- Experience with hybrid cloud or multi-cloud environments (preferred)
- Familiarity with GitOps practices and tools like Bitbucket Pipeline, Jenkins, ArgoCD or Flux (preferred)
- Background in infrastructure cost optimization and capacity planning (preferred)