
Senior AI Performance and Efficiency Engineer
NVIDIA
full-time
Posted on:
Location Type: Remote
Location: California • New York • United States
Visit company websiteExplore more
Salary
💰 $152,000 - $241,500 per year
Job Level
About the role
- Collaborate closely with our AI/ML researchers to make their ML models more efficient leading to significant productivity improvements and cost savings
- Build tools, frameworks, and apply ML techniques to detect & analyze efficiency bottlenecks and deliver productivity improvements for our researchers
- Work with researchers working on a variety of innovative ML workloads across Robotics, Autonomous vehicles, LLM’s, Videos and more
- Collaborate across the engineering organizations to deliver efficiency in our usage of hardware, software, and infrastructure
- Proactively monitor fleet wide utilization patterns, analyze existing inefficiency patterns, or discover new patterns, and deliver scalable solutions to solve them
- Keep up to date with the most recent developments in AI/ML technologies, frameworks, and successful strategies, and advocate for their integration within the organization.
Requirements
- BS or similar background in Computer Science or related area (or equivalent experience)
- Minimum 5+ years of experience designing and operating large scale compute infrastructure
- Strong understanding of modern ML techniques and tools
- Experience investigating, and resolving, training & inference performance end to end
- Debugging and optimization experience with NSight Systems and NSight Compute
- Experience with debugging large-scale distributed training using NCCL
- Proficiency in programming & scripting languages such as Python, Go, Bash
- Familiarity with cloud computing platforms (e.g., AWS, GCP, Azure)
- Dedication to ongoing learning and staying updated on new technologies and innovative methods in the AI/ML infrastructure sector
- Excellent communication and collaboration skills, with the ability to work effectively with teams and individuals of different backgrounds
Benefits
- Competitive salaries
- Comprehensive benefits package
- Equity
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
machine learning techniqueslarge scale compute infrastructuredebuggingoptimizationdistributed trainingprogramming languagesscripting languagesperformance analysisefficiency analysisscalable solutions
Soft Skills
communication skillscollaboration skillsproblem-solvingteamworkadaptabilitydedication to learninginterpersonal skillsanalytical thinkingproactive monitoringadvocacy for integration
Certifications
BS in Computer Scienceequivalent experience