
Senior GPU Scheduler Engineer
NVIDIA
full-time
Posted on:
Location Type: Office
Location: Santa Clara • California • Washington • United States
Visit company websiteExplore more
Salary
💰 $152,000 - $241,500 per year
Job Level
Tech Stack
About the role
- Design and develop new scheduling features and add-on services to improve GPU compute clusters across many dimensions, such as resource usage fairness, GPU occupancy, GPU waste, application resilience, application performance and power usage.
- Design and develop batch workload management and orchestration services
- Provide support to staff and end users to resolve batch scheduler issues
- Build and improve our ecosystem around GPU-accelerated computing
- Performance analysis and optimizations of deep learning workflows
- Develop large scale automation solutions
- Root cause analysis and suggest corrective action for problems large and small scales
- Finding and fixing problems before they occur
Requirements
- Bachelor’s degree in Computer Science, Electrical Engineering or related field or equivalent experience
- 5+ years of work experience
- Strong understanding of batch scheduling, preferably with experience in schedulers such as SLURM or K8s batch schedulers (Kueue, Volcano, etc.)
- Significant experience in systems programming languages such as C/C++ & Go as well as scripting languages such as Python and bash
- Established experience in Linux operating system, environment and tools
- Experience analyzing and tuning performance for a variety of AI workloads
- In-depth understanding of container technologies like Docker, Singularity, Podman
- Flexibility/adaptability for working in a dynamic environment with different frameworks and requirements
- Excellent communication, interpersonal and customer collaboration skills
Benefits
- equity
- benefits 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
batch schedulingsystems programmingCC++GoPythonbashperformance analysisautomation solutionsroot cause analysis
Soft Skills
communicationinterpersonal skillscustomer collaborationflexibilityadaptability
Certifications
Bachelor’s degree in Computer ScienceBachelor’s degree in Electrical Engineering