
Senior Software Architect – Deep Learning, HPC Communications
NVIDIA
full-time
Posted on:
Location Type: Hybrid
Location: Santa Clara • California • Massachusetts • United States
Visit company websiteExplore more
Salary
💰 $184,000 - $287,500 per year
Job Level
Tech Stack
About the role
- Investigate opportunities to improve communication performance by identifying bottlenecks in today's systems
- Design and implement new communication technologies to accelerate AI and HPC workloads
- Explore innovative solutions in HW and SW for our next generation platforms as part of co-design efforts involving GPU, Networking, and SW architects
- Build proofs-of-concept, conduct experiments, and perform quantitive modeling to evaluate and drive new innovations
- Use simulation to explore performance of large GPU clusters (think scales of 100s of 1000s of GPUs)
Requirements
- M.S./Ph.D. degree in CS/CE or equivalent experience
- 5+ years of relevant experience
- Excellent C/C++ programming and debugging skills
- Experience with parallel programming models (MPI, SHMEM) and at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC)
- Deep understanding of operating systems, computer and system architecture
- Solid in fundamentals of network architecture, topology, algorithms, and communication scaling relevant to AI and HPC workloads
- Strong experience with Linux
- Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment
Benefits
- equity
- benefits 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
CC++parallel programmingMPISHMEMNCCLNVSHMEMOpenSHMEMUCXUCC
Soft Skills
communicationflexibilitycollaboration
Certifications
M.S. degreePh.D. degree