
Lead Associate Principal, Cloud Engineering
OCC
full-time
Posted on:
Location Type: Hybrid
Location: Dallas • Illinois • Texas • United States
Visit company websiteExplore more
Salary
💰 $143,300 - $228,200 per year
Job Level
Tech Stack
About the role
- Perform a range of activities required to maintain and continuously automate a large, complex Kubernetes-based cloud computing environment.
- Provide technical guidance to the team and serve as a technical liaison between internal departments.
- Utilize best practices for managing, architecting, configuring, ensuring high availability, disaster recovery, administration, and automation of Kubernetes clusters and containerized workloads with cloud-native technologies.
- Drive the creation of new infrastructure and environments critical to growth and adoption of Kubernetes/container orchestration goals across the business.
- Design, configure, implement and manage Kubernetes clusters, maintain a fully automated workflow for provisioning and managing a complex, highly available container orchestration environment using infrastructure as code.
- Develop and maintain Kubernetes operators, controllers, and custom resources to extend cluster functionality and automate application lifecycle management.
- Manage DevOps development activities and complex tasks with tools like Docker, Kafka, and Kubernetes ecosystem tools.
- Lead and participate in Kubernetes cluster build-outs, upgrades, software installation, maintenance and support including patches, security fixes, end-of-life preparation, and version upgrades.
- Implement and manage Kubernetes networking solutions, service mesh architectures, security policies, and RBAC configurations.
- Ensure reliability of Kubernetes platforms and containerized services, meeting internal and external quality standards.
- Assess and plan for capacity needs within Kubernetes clusters and the underlying cloud platform.
- Write and maintain documentation of relevant Kubernetes architectures, systems, procedures, and processes.
Requirements
- Good consultative, communication, team player and analytical skills are a must
- Working knowledge of Kubernetes architecture, container orchestration, and cloud-native infrastructure design and components, such as: etcd, networking, storage, and container runtimes
- Extensive hands-on experience with Kubernetes cluster creation, maintenance, support, and administration in production environments
- Deep understanding and practical implementation experience with Kubernetes networking (CNI plugins, service types, ingress controllers), runtime security (Pod Security Standards, OPA/Gatekeeper, network policies), and Role-Based Access Control (RBAC)
- Experience with architecting, implementing and maintaining highly available mission critical Kubernetes environments for 24/7 availability
- Experience working in an environment with a defined production change control process
- Demonstrates history of working within deadlines and ability to work well under pressure
- Production-level hands-on experience with AWS cloud services and implementing Kubernetes on AWS (EKS or self-managed clusters)
- Extensive experience with Infrastructure as Code using Terraform for provisioning and managing cloud infrastructure and Kubernetes resources
- Strong hands-on development skills with demonstrable coding experience in Go or Python (Go strongly preferred for Kubernetes operator/controller development). Candidates must be able to provide specific examples of production code they have written.
- Hands-on experience with Kubernetes ecosystem tools including: Helm, kubectl, container runtimes (containerd, CRI-O), and monitoring/observability tools
- Experience with CI/CD tools such as Jenkins, GitLab CI, or GitHub Actions.
- Experience with version control using GitHub or similar platforms
- Experience with configuration management tools such as Ansible, Puppet, or Chef
- Hands-on experience with Kubernetes operator/controller development using operator frameworks (Kubebuilder, Operator SDK, or similar). This can be demonstrated through either contributions to open-source Cloud Native Computing Foundation (CNCF) projects, OR Development of in-house Kubernetes operators/controllers. Note: If you have contributed to open-source CNCF projects, please include your GitHub profile link or links to notable Pull Requests in your resume.
- Experience with Rancher and RKE2 (Rancher Kubernetes Engine 2) Kubernetes distribution
- Experience with service mesh technologies (Istio, Linkerd) and Envoy proxy configuration and management
- Experience designing and implementing multi-tenancy architectures in Kubernetes environments
- Experience with GitOps-based continuous deployment using FluxCD, ArgoCD, or Rancher Fleet
- Experience with Kafka and event-driven architectures
- CKA, CKS certifications strongly desired
- AWS Solutions Architect Associate Certification or higher
- Relevant industry certifications such as Microsoft Azure or Google Cloud Platform
- Bachelor's degree, preferably in a technical discipline (Computer Science, Mathematics, Engineering, etc.), or equivalent combination of education and experience required
- 7+ years’ experience in IT systems installation, operations, administration, and maintenance of cloud systems / virtualized servers, with demonstrated significant experience in Kubernetes and container orchestration platforms
Benefits
- A hybrid work environment, up to 2 days per week of remote work
- Tuition Reimbursement to support your continued education
- Student Loan Repayment Assistance
- Technology Stipend allowing you to use the device of your choice to connect to our network while working remotely
- Generous PTO and Parental leave
- 401k Employer Match
- Competitive health benefits including medical, dental and vision
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
Kubernetescontainer orchestrationAWSTerraformGoPythonCI/CDGitAnsibleKafka
Soft skills
consultative skillscommunicationteam playeranalytical skillsability to work under pressuredeadline management
Certifications
CKACKSAWS Solutions Architect AssociateMicrosoft AzureGoogle Cloud Platform