Tech Stack AWS Cloud EC2 Grafana Kubernetes Python Terraform
About the role Design, evolve, secure, and maintain scalable, resilient, and cost-efficient AWS cloud infrastructure and environments. Implement and manage Kubernetes (EKS) clusters to support modern containerised workloads. Build and maintain infrastructure-as-code using Terraform. Develop and manage CI/CD pipelines using GitHub and GitHub Actions. Implement and manage GitOps workflows with Argo CD. Ensure observability and system health through monitoring and logging tools, including Grafana and New Relic. Collaborate with Customer Success, Technical Support, architects, engineers, and customer-facing teams to resolve incidents within SLA objectives. Provide clear, accurate incident updates and stakeholder communications. Lead or contribute to process improvement initiatives and drive automation to maximise team efficiency. Mentor and coach team members, promoting knowledge sharing and technical growth. Maintain detailed configuration, infrastructure-as-code, and technical documentation. Stay ahead of emerging cloud technologies, including AI/ML, and apply them to enhance platform capabilities. Participate in the On-Call rota and provide operational support as required. Take ownership of information security and data protection responsibilities, supporting compliance with regulatory and legal requirements. Requirements Strong hands-on experience with AWS cloud infrastructure and services (EC2, S3, RDS, IAM, VPC, Lambda, etc.) Proven expertise in Kubernetes and Amazon EKS (mandatory) Proficiency with Terraform for infrastructure-as-code (mandatory) Hands-on experience with GitHub and GitHub Actions for CI/CD automation (mandatory) Experience with Argo CD for GitOps workflows (mandatory) Strong knowledge of monitoring and observability tools, including Grafana and New Relic (mandatory) Strong scripting skills (Python, Bash, or similar) Deep understanding of cloud security principles and practices Excellent communication, troubleshooting, and problem-solving skills Ability to participate in an on-call rota and provide operational support Achieve and maintain AWS certifications (highly desired) Exposure to AI/ML workloads and services (e.g., SageMaker, Bedrock) (highly desirable) Experience with serverless computing and event-driven architectures (highly desirable) Knowledge of modern data engineering patterns and multi-cloud or hybrid environments (highly desirable) We’re on a journey to become market leaders; collaborate and learn from industry experts from all over the globe. Work with game-changing products and services. Get the training and support you need to try new things, adapt to quick changes and explore different paths. Progress your career, your way; investment in development and professional growth. An inclusive work environment that respects all dimensions of diversity. Culture which fosters support and unbridled collaboration. Pay and benefits reflect performance and are designed to attract top talent. No academic qualifications required; selection based on experience and potential. Equal opportunity employer; encourage candidates of all backgrounds to apply. Copy Applicant Tracking System Keywords Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills AWS Kubernetes EKS Terraform CI/CD GitHub GitHub Actions Argo CD Grafana New Relic
Soft skills communication troubleshooting problem-solving mentoring collaboration process improvement automation knowledge sharing incident management stakeholder communication
Certifications AWS certifications