
Senior Deep Learning Solution Architect
NVIDIA
full-time
Posted on:
Location Type: Office
Location: Beijing • China
Visit company websiteExplore more
Job Level
About the role
- Contribute to the development of open-source inference frameworks such as SGLang and vLLM
- Develop and optimize KV cache offloading frameworks for LLM workloads
- Drive R&D on compute performance in distributed training
- Study computational challenges in machine learning systems
Requirements
- Over 5 years working experience in the technology industry
- Master’s degree or above in computer science, mathematics, electrical engineering, automation, or related fields
- Strong interest in accelerated computing, parallel computing, and heterogeneous computing
- Solid programming skills with understanding of data structures and computer systems fundamentals
- Strong learning agility and adaptability
Benefits
- Competitive salaries
- Generous benefits package
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
programmingdata structurescomputer systems fundamentalsinference frameworksKV cache offloadingdistributed trainingmachine learning systemsaccelerated computingparallel computingheterogeneous computing
Soft Skills
learning agilityadaptability
Certifications
Master’s degree