Significantly contribute to the evolution of Nebari (https://nebari.dev) and design reusable, modular infrastructure components that can be composed into bespoke Kubernetes-based platforms for sovereign AI deployments
Develop composable MLOps components and infrastructure patterns supporting model training, serving, monitoring, and CI/CD pipelines that organizations can own and operate
Design and implement observability, monitoring, and cost optimization strategies for large-scale AI/ML workloads on client-owned Kubernetes infrastructure
Collaborate with ML engineers to optimize infrastructure for training ML models, quantizing and packaging open weight LLMs, computer vision workloads, and other AI applications in sovereign environments
Contribute to open-source MLOps tooling and Kubernetes ecosystem projects that enable data sovereignty
Work with clients to deploy, configure, and optimize their sovereign AI infrastructure
Collaborate with a fully remote distributed team using asynchronous communication methods
Requirements
4+ years of hands-on infrastructure/platform/DevOps experience with production systems
Strong understanding of infrastructure engineering principles: scalability, reliability, observability, and automation
Solid experience with Kubernetes in production environments, including troubleshooting and optimization
Proficiency with Infrastructure-as-Code tooling (Terraform, Helm, or similar) for managing complex deployments
Experience with at least one major cloud platform (AWS, Azure, GCP) including networking, security, and compute services
Strong programming skills, particularly in Python and/or Go, with ability to write maintainable infrastructure code
Experience contributing to technical initiatives or mentoring junior team members
Understanding of CI/CD practices, GitOps workflows, and infrastructure automation principles
Comfortable working independently and in distributed teams
Ability to provide and constructively receive feedback
Available for collaboration during overlap with US Central Time zone
Benefits
Medical, Dental & Vision – 100% paid for employees, 75% for dependents
401(k) Match – Up to 5% with full vesting after 2 years
Unlimited PTO – With a required minimum of 15 days off annually
Fully Remote Setup – Includes up to $3,000 equipment reimbursement
Continuous Education – Includes up to $500 reimbursement
Disability & Life Insurance – 100% employer-paid
HSA & FSA Options – With monthly HSA contributions from OpenTeams
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.