FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Kubernetes Platform Engineer – AI Infrastructure
CiscoKubernetes Platform Engineer designing, building, and operating on-prem software for AI/ML workloads. Collaborating with data scientists and ML engineers to leverage Kubernetes technology.
Posted 5/15/2026full-timeRTP • North Carolina, Texas • 🇺🇸 United StatesMid-LevelSenior💰 $126,500 - $182,000 per yearWebsite
Tech Stack
Tools & technologiesDistributed SystemsGoKubernetesOpenShiftPython
About the role
Key responsibilities & impact- Design, build, and operate large-scale on-prem Kubernetes platforms (OpenShift/Anthos)
- Architect scalable, multi-tenant platform infrastructure as the foundation for AI/ML and GenAI workloads
- Enable and optimize AI/ML workloads, including GPU-based environments for training, inference, and model deployment
- Partner with data scientists and ML engineers to onboard and scale ML pipelines and workflows
- Build platform capabilities using Kubernetes controllers, operators, CRDs, and Golang/Python services
- Implement Infrastructure as Code, automation, and AIOps-driven self-healing using platform telemetry and observability
- Ensure reliability through performance tuning (scheduling, resource utilization) and participate in on-call support and incident response
Requirements
What you’ll need- 5+ years of software engineering experience, including supporting AI/ML or GPU-based workloads on Kubernetes platforms
- 3+ years operating Kubernetes in production with control plane ownership, preferably in on-prem or self-managed environments
- Strong experience with etcd management (backup, restore, recovery) and Kubernetes cluster upgrades
- Proficiency in Go with experience building Kubernetes controllers/operators, CRDs, and webhooks
- Deep understanding of Kubernetes internals (API server, scheduler, controller loops, reconciliation patterns)
- Proven ability to debug and operate large-scale distributed systems in production environments, including participation in on-call rotations
Benefits
Comp & perks- medical, dental and vision insurance
- 401(k) plan with a Cisco matching contribution
- paid parental leave
- short and long-term disability coverage
- basic life insurance
- 10 paid holidays per full calendar year
- 1 floating holiday for non-exempt employees
- 1 paid day off for employee’s birthday
- paid year-end holiday shutdown
- 4 paid days off for personal wellness
- 16 days of paid vacation time per full calendar year (non-exempt)
- flexible vacation time off program (exempt)
- 80 hours of sick time off provided on hire date and each January 1st thereafter
- additional paid time away may be requested
- optional 10 paid days per full calendar year to volunteer
- employees eligible to earn annual bonuses
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
KubernetesOpenShiftAnthosAI/MLGolangPythonInfrastructure as CodeAIOpsetcd managementdistributed systems
Soft Skills
collaborationproblem-solvingdebuggingincident responseperformance tuning