Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
NVIDIA

Principal Deep Learning Communication Architect

NVIDIA

NVIDIA seeks a Principal Deep Learning Communication Architect to lead communication libraries and AI development across next-gen platforms. Join a team of top talent in a transformative AI environment.

Posted 4/14/2026full-timeRemote • California, Texas • 🇺🇸 United StatesLead💰 $272,000 - $431,250 per yearWebsite

About the role

Key responsibilities & impact
  • Define the long-term technical roadmap for communication libraries across NVIDIA’s next-generation platforms
  • Lead the development of next-generation communication primitives and collective algorithms
  • Partner with application developers to architect and implement specialized communication primitives
  • Collaborate with silicon architects and software engineers to influence hardware specifications for next-generation networking
  • Develop high-fidelity analytical models and simulators to predict system behavior under emerging workloads

Requirements

What you’ll need
  • Ph.D. or M.S. in Computer Science, Electrical Engineering, or a related field (or equivalent experience)
  • 12+ years of industry experience in high-performance computing (HPC) or distributed deep learning
  • Deep understanding of 3D parallelism (Data, Tensor, Pipeline) and advanced strategies including Context Parallelism, Expert Parallelism, and Zero Redundancy Optimizer (ZeRO) variants
  • Deep technical proficiency with NCCL, UCX, UCC, NVSHMEM, or MPI
  • Experience with RDMA, RoCE, and low-level InfiniBand verbs
  • Advanced knowledge of high-throughput inference engines and schedulers, specifically TensorRT-LLM, vLLM, SGLang, and NVIDIA Dynamo
  • Expert knowledge of the NVIDIA GPU memory hierarchy (HBM3e/HBM4, L2 cache)

Benefits

Comp & perks
  • equity
  • benefits 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
high-performance computingdistributed deep learning3D parallelismContext ParallelismExpert ParallelismZero Redundancy OptimizerNCCLUCXUCCNVSHMEM
Soft Skills
leadershipcollaborationcommunication