OCC

Lead Associate Principal, Cloud Engineering

OCC

full-time

Posted on:

Location Type: Hybrid

Location: DallasIllinoisTexasUnited States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $143,300 - $228,200 per year

Job Level

About the role

  • Perform a range of activities required to maintain and continuously automate a large, complex Kubernetes-based cloud computing environment.
  • Provide technical guidance to the team and serve as a technical liaison between internal departments.
  • Utilize best practices for managing, architecting, configuring, ensuring high availability, disaster recovery, administration, and automation of Kubernetes clusters and containerized workloads with cloud-native technologies.
  • Drive the creation of new infrastructure and environments critical to growth and adoption of Kubernetes/container orchestration goals across the business.
  • Design, configure, implement and manage Kubernetes clusters, maintain a fully automated workflow for provisioning and managing a complex, highly available container orchestration environment using infrastructure as code.
  • Develop and maintain Kubernetes operators, controllers, and custom resources to extend cluster functionality and automate application lifecycle management.
  • Manage DevOps development activities and complex tasks with tools like Docker, Kafka, and Kubernetes ecosystem tools.
  • Lead and participate in Kubernetes cluster build-outs, upgrades, software installation, maintenance and support including patches, security fixes, end-of-life preparation, and version upgrades.
  • Implement and manage Kubernetes networking solutions, service mesh architectures, security policies, and RBAC configurations.
  • Ensure reliability of Kubernetes platforms and containerized services, meeting internal and external quality standards.
  • Assess and plan for capacity needs within Kubernetes clusters and the underlying cloud platform.
  • Write and maintain documentation of relevant Kubernetes architectures, systems, procedures, and processes.

Requirements

  • Good consultative, communication, team player and analytical skills are a must
  • Working knowledge of Kubernetes architecture, container orchestration, and cloud-native infrastructure design and components, such as: etcd, networking, storage, and container runtimes
  • Extensive hands-on experience with Kubernetes cluster creation, maintenance, support, and administration in production environments
  • Deep understanding and practical implementation experience with Kubernetes networking (CNI plugins, service types, ingress controllers), runtime security (Pod Security Standards, OPA/Gatekeeper, network policies), and Role-Based Access Control (RBAC)
  • Experience with architecting, implementing and maintaining highly available mission critical Kubernetes environments for 24/7 availability
  • Experience working in an environment with a defined production change control process
  • Demonstrates history of working within deadlines and ability to work well under pressure
  • Production-level hands-on experience with AWS cloud services and implementing Kubernetes on AWS (EKS or self-managed clusters)
  • Extensive experience with Infrastructure as Code using Terraform for provisioning and managing cloud infrastructure and Kubernetes resources
  • Strong hands-on development skills with demonstrable coding experience in Go or Python (Go strongly preferred for Kubernetes operator/controller development). Candidates must be able to provide specific examples of production code they have written.
  • Hands-on experience with Kubernetes ecosystem tools including: Helm, kubectl, container runtimes (containerd, CRI-O), and monitoring/observability tools
  • Experience with CI/CD tools such as Jenkins, GitLab CI, or GitHub Actions.
  • Experience with version control using GitHub or similar platforms
  • Experience with configuration management tools such as Ansible, Puppet, or Chef
  • Hands-on experience with Kubernetes operator/controller development using operator frameworks (Kubebuilder, Operator SDK, or similar). This can be demonstrated through either contributions to open-source Cloud Native Computing Foundation (CNCF) projects, OR Development of in-house Kubernetes operators/controllers. Note: If you have contributed to open-source CNCF projects, please include your GitHub profile link or links to notable Pull Requests in your resume.
  • Experience with Rancher and RKE2 (Rancher Kubernetes Engine 2) Kubernetes distribution
  • Experience with service mesh technologies (Istio, Linkerd) and Envoy proxy configuration and management
  • Experience designing and implementing multi-tenancy architectures in Kubernetes environments
  • Experience with GitOps-based continuous deployment using FluxCD, ArgoCD, or Rancher Fleet
  • Experience with Kafka and event-driven architectures
  • CKA, CKS certifications strongly desired
  • AWS Solutions Architect Associate Certification or higher
  • Relevant industry certifications such as Microsoft Azure or Google Cloud Platform
  • Bachelor's degree, preferably in a technical discipline (Computer Science, Mathematics, Engineering, etc.), or equivalent combination of education and experience required
  • 7+ years’ experience in IT systems installation, operations, administration, and maintenance of cloud systems / virtualized servers, with demonstrated significant experience in Kubernetes and container orchestration platforms
Benefits
  • A hybrid work environment, up to 2 days per week of remote work
  • Tuition Reimbursement to support your continued education
  • Student Loan Repayment Assistance
  • Technology Stipend allowing you to use the device of your choice to connect to our network while working remotely
  • Generous PTO and Parental leave
  • 401k Employer Match
  • Competitive health benefits including medical, dental and vision

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
Kubernetescontainer orchestrationAWSTerraformGoPythonCI/CDGitAnsibleKafka
Soft skills
consultative skillscommunicationteam playeranalytical skillsability to work under pressuredeadline management
Certifications
CKACKSAWS Solutions Architect AssociateMicrosoft AzureGoogle Cloud Platform