FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Systems Performance Engineer – Agentic AI Workloads
NVIDIADeep Learning Architect for agentic AI workloads at NVIDIA. Collaborating on AI infrastructure and performing system traffic modeling and simulations.
Posted 6/4/2026full-timeCalifornia, North Carolina, Oregon • 🇺🇸 United StatesMid-LevelSenior💰 $124,000 - $195,500 per yearWebsite
Tech Stack
Tools & technologiesErlangPython
About the role
Key responsibilities & impact- Develop and extend C++ and Python simulators that model system-level network and compute traffic for agentic LLM workloads in datacenter environments
- Characterize real-world LLM serving workloads and distill them into representative simulator inputs
- Run simulations at scale and apply statistical techniques to post-process and interpret results
- Identify performance bottlenecks and translate findings into concrete architectural recommendations
- Collaborate with hardware, software, and research teams to influence the design of future AI systems
Requirements
What you’ll need- Pursuing or recently completed a MS, or PhD in CS, EE, Mathematics, or a related field (or equivalent experience)
- Strong programming skills in C++ and Python
- Solid foundations in queueing theory and traffic modeling (e.g., Erlang models, Little's Law)
- Strong statistics background: characterize huge datasets with percentiles, distributions, and clustering techniques such as K-means
- Understanding of deep learning fundamentals, LLMs, and modern inference serving frameworks
Benefits
Comp & perks- equity
- benefits 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
C++Pythonqueueing theorytraffic modelingErlang modelsLittle's LawstatisticsK-means clusteringdeep learningLLMs