Operate and support Kubernetes clusters (on-prem and cloud) including installation, upgrades, monitoring, and tuning.
Design, maintain, and optimize infrastructure pipelines using tools like Ansible, Terraform, Jenkins, GitLab CI, or similar.
Support deployments across AWS, Azure, and/or OCI including VM provisioning, networking, identity access, and storage configuration.
Troubleshoot issues across application stacks, Linux, and Windows systems—identifying root causes in complex environments involving containers, services, and hybrid infrastructure.
Independently investigate ambiguous technical problems, develop conceptual solutions, and drive implementation to closure without heavy oversight.
Assist with patching, access controls, and secure configuration of systems in accordance with NIST, STIGs, or internal policy requirements.
Write operational documentation, runbooks, and contribute to knowledge-sharing within the team.
Participate in daily standups, sprint planning, and retrospectives when applicable.
Requirements
Typically requires a University Degree or equivalent experience and minimum 10 years prior relevant experience, or an Advanced Degree in a related field and minimum 7 years experience
7+ years of hands-on experience with Kubernetes (preferably both managed and self-hosted clusters)
Experience with cloud environments (AWS, Azure, or OCI)—infrastructure provisioning, networking, IAM.
Proficient in Ansible and at least one CI/CD platform (e.g., GitLab CI, Jenkins, AWS CodeBuild, Azure DevOps).
Advanced troubleshooting and root cause analysis skills in distributed systems.
Ability to work independently in high-ambiguity environments and push projects through to resolution.
Experience with container security, observability tooling (e.g., Prometheus, Grafana), and Helm charts.
Familiarity with scripting in Bash, Python, or PowerShell.
Knowledge of infrastructure-as-code (Terraform).
Exposure to secure software development practices and compliance frameworks (FedRAMP, NIST 800-171, DoD STIGs).
Strong communication and documentation skills.
Self-starter with a bias for action and accountability.
Collaborative mindset with the ability to interface with developers, security, and product teams.
Benefits
parental (including paternal) leave
flexible work schedules
achievement awards
educational assistance
child/adult backup care
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
advanced troubleshootingroot cause analysisindependent workcommunicationdocumentationcollaborationself-starteraccountabilityproblem-solvingadaptability