Kubernetes Operations: Manage and scale multi-cluster Kubernetes deployments, ensuring high availability, performance, and reliability.
Traffic Management: Design and implement traffic strategies (e.g., canary releases, blue/green deployments, A/B testing, gradual rollouts) using Istio/Envoy or similar service mesh technologies.
Release Engineering: Build and maintain CI/CD pipelines, automate deployments and rollbacks, and improve release efficiency and reliability.
Infrastructure as Code (IaC): Use Terraform and other IaC tools to provision and manage cloud infrastructure, ensuring consistency and auditability.
Observability & Incident Response: Establish monitoring, logging, and tracing solutions; troubleshoot and resolve production issues quickly to maintain system stability.
Documentation & Knowledge Sharing: Write and maintain clear technical documentation (system architecture, release processes, traffic policies, runbooks, best practices) to enable effective onboarding and collaboration.
Cross-Team Collaboration: Partner with developers, SREs, and platform teams to design scalable release and traffic strategies, and drive adoption of engineering best practices.
Requirements
5+ Years experience in IaC with a Cloud Provider (AWS)
3+ Years of experience with production Kubernetes Clusters
Hands-on experience managing Kubernetes in production environments.
Strong understanding of service mesh technologies (Istio, Envoy, or similar).
Expertise in CI/CD workflows and tools such as ArgoCD, FluxCD, GitHub Actions, or Jenkins.
Solid foundation in Linux, networking, and containerization.
Strong technical writing skills—able to produce clear, structured documentation for both technical and non-technical audiences.
Strong problem-solving skills, with proven experience in high-pressure incident response.
Excellent communication and collaboration skills, with a mindset for driving engineering efficiency and quality.
Benefits
Annual discretionary bonus
Long-term incentive plan
Medical/dental/vision insurance
401(k) plan
Paid time off
Flexible time off policy
Generous parental leave program
Monthly wellness reimbursement
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.