NVIDIA

Senior Software Engineer, AI and DL Kernel Libraries

NVIDIA

full-time

Posted on:

Location Type: Remote

Location: CaliforniaColoradoUnited States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $184,000 - $287,500 per year

Job Level

About the role

  • Innovating and developing new AI systems technologies for efficient inference
  • Designing, implementing, and optimizing kernels for high impact AI workloads
  • Designing and implementing extensible abstractions for LLM serving engines
  • Building efficient just-in-time domain specific compilers and runtimes
  • Collaborating closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams
  • Contributing to open source communities like FlashInfer, vLLM, and SGLang

Requirements

  • Masters degree in Computer Science, Electrical Engineering, or related field (or equivalent experience); PhD are preferred
  • 6+ years (academic/ industry) experience with ML/DL systems development preferable
  • Strong experience in developing or using deep learning frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX, etc) and ideally inference engines and runtimes such as vLLM, SGLang, and MLC.
  • Strong Python and C/C++ programming skills
  • Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, Triton, or similar)
Benefits
  • equity
  • benefits 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
AI systems technologieskernels optimizationLLM serving enginesdomain specific compilersjust-in-time compilationdeep learning frameworksPythonC/C++GPU kernel developmentperformance optimization
Soft Skills
collaboration
Certifications
Masters degree in Computer ScienceMasters degree in Electrical EngineeringPhD