NVIDIA

Senior Deep Learning Solution Architect

NVIDIA

full-time

Posted on:

Location Type: Office

Location: BeijingChina

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Contribute to the development of open-source inference frameworks such as SGLang and vLLM
  • Develop and optimize KV cache offloading frameworks for LLM workloads
  • Drive R&D on compute performance in distributed training
  • Study computational challenges in machine learning systems

Requirements

  • Over 5 years working experience in the technology industry
  • Master’s degree or above in computer science, mathematics, electrical engineering, automation, or related fields
  • Strong interest in accelerated computing, parallel computing, and heterogeneous computing
  • Solid programming skills with understanding of data structures and computer systems fundamentals
  • Strong learning agility and adaptability
Benefits
  • Competitive salaries
  • Generous benefits package
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
programmingdata structurescomputer systems fundamentalsinference frameworksKV cache offloadingdistributed trainingmachine learning systemsaccelerated computingparallel computingheterogeneous computing
Soft Skills
learning agilityadaptability
Certifications
Master’s degree