Architect and deploy OpenStack and Kubernetes clusters designed for GPU scheduling, high performance, and multi-tenant workloads.
Automate deployment pipelines for cloud infrastructure using Terraform, Ansible, Helm, and Kubernetes Operators.
Build and manage GPU-ready container runtimes, NVIDIA device plugins, and Kubernetes-native GPU provisioning frameworks.
Ensure high availability and performance of OpenStack and Kubernetes clusters using tools such as Prometheus, Grafana, Loki, and Thanos.
Implement secure namespace isolation, RBAC, and network policies across OpenStack and Kubernetes layers.
Work cross-functionally with DevOps, AI, Support, and Product teams to align infrastructure services with platform goals. Provide guidance on Kubernetes best practices.
Requirements
5+ years of experience with OpenStack in production environments.
3+ years of experience managing production-grade Kubernetes clusters, including bare-metal or private cloud environments.
Strong hands-on expertise with:
Kubernetes operators, Helm, and custom resource definitions (CRDs)
GPU orchestration in Kubernetes using NVIDIA tools
Multi-cluster or federated Kubernetes
Proficiency in Linux, Ceph, networking (Calico/Cilium), and infrastructure scripting (Python, Bash).
Strong knowledge of cloud-native security, policy frameworks, and service meshes.
Experience with CI/CD pipelines, GitOps, and infrastructure-as-code tooling (Terraform, Ansible, ArgoCD).
Benefits
Competitive salary
100% home-office
Full-time permanent contract.
Opportunity to work with a diverse team of talented professionals who are passionate about technology and innovation.
A collaborative and supportive work environment that encourages professional growth and development.
Exposure to cutting-edge technologies and the opportunity to make a significant impact on the future of cloud computing.
Possibility to participate on international events
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.