Salary
💰 $148,000 - $287,500 per year
Tech Stack
AnsibleCloudDockerKubernetesLinuxOpen SourcePython
About the role
- Our day-to-day work involves guiding partners in their adoption of end-to-end Agentic AI solutions, using NVIDIA's compute, networking, and software stacks.
- Don't think this is a high-level slideshow job - we are the voice of experience, using cloud native methodologies, low latency networks, and accelerated compute to help build modern AI factories.
- We also excel at sharing knowledge with others, whether it's delivering demos, assisting with proof-of-concepts, or writing papers and developer blogs.
- By collaborating with executives and engineering, we solve complex problems and help bring NVIDIA's premiere technologies to life in the cloud and in the datacenter.
- Our mission is to solve the problems that nobody else has solved yet, and we need someone to be an instrumental part of that!
Requirements
- BS, MS, or PhD in Engineering, Computer Science, or a related field (or equivalent experience).
- Established track record working with AI and HPC clusters, both on-premises and cloud based.
- 4 plus years of proven experience with cluster management and related tools, including Docker Containers, Slurm, Kubernetes, and Ansible.
- Hands-on experience with network, storage, cluster configuration and debugging.
- Strong analytical and problem-solving skills, along with an ability to articulate what you know to others.
- Ability to multitask efficiently in a dynamic environment.