NVIDIA

Senior AI Performance and Efficiency Engineer

NVIDIA

full-time

Posted on:

Location Type: Remote

Location: CaliforniaNew YorkUnited States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $152,000 - $241,500 per year

Job Level

About the role

  • Collaborate closely with our AI/ML researchers to make their ML models more efficient leading to significant productivity improvements and cost savings
  • Build tools, frameworks, and apply ML techniques to detect & analyze efficiency bottlenecks and deliver productivity improvements for our researchers
  • Work with researchers working on a variety of innovative ML workloads across Robotics, Autonomous vehicles, LLM’s, Videos and more
  • Collaborate across the engineering organizations to deliver efficiency in our usage of hardware, software, and infrastructure
  • Proactively monitor fleet wide utilization patterns, analyze existing inefficiency patterns, or discover new patterns, and deliver scalable solutions to solve them
  • Keep up to date with the most recent developments in AI/ML technologies, frameworks, and successful strategies, and advocate for their integration within the organization.

Requirements

  • BS or similar background in Computer Science or related area (or equivalent experience)
  • Minimum 5+ years of experience designing and operating large scale compute infrastructure
  • Strong understanding of modern ML techniques and tools
  • Experience investigating, and resolving, training & inference performance end to end
  • Debugging and optimization experience with NSight Systems and NSight Compute
  • Experience with debugging large-scale distributed training using NCCL
  • Proficiency in programming & scripting languages such as Python, Go, Bash
  • Familiarity with cloud computing platforms (e.g., AWS, GCP, Azure)
  • Dedication to ongoing learning and staying updated on new technologies and innovative methods in the AI/ML infrastructure sector
  • Excellent communication and collaboration skills, with the ability to work effectively with teams and individuals of different backgrounds
Benefits
  • Competitive salaries
  • Comprehensive benefits package
  • Equity
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
machine learning techniqueslarge scale compute infrastructuredebuggingoptimizationdistributed trainingprogramming languagesscripting languagesperformance analysisefficiency analysisscalable solutions
Soft Skills
communication skillscollaboration skillsproblem-solvingteamworkadaptabilitydedication to learninginterpersonal skillsanalytical thinkingproactive monitoringadvocacy for integration
Certifications
BS in Computer Scienceequivalent experience