FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Senior Software Engineer – CUTLASS Performance
NVIDIASenior Software Engineer focusing on CUTLASS performance to enhance NVIDIA's AI infrastructure capabilities. Engaging with cross-functional teams to benchmark and optimize deep learning performance.
Posted 6/1/2026full-timeSanta Clara • California, North Carolina, Oregon, Texas, Washington • 🇺🇸 United StatesSenior💰 $152,000 - $287,500 per yearWebsite
Tech Stack
Tools & technologiesPython
About the role
Key responsibilities & impact- Benchmark the performance of state-of-the-art deep learning models’ inference and training passes to identify key GPU kernel and fusion opportunities.
- Identify gaps between theoretical and realized performance, and suggest software improvements or model adjustments to resolve them.
- Develop tooling to automate the benchmarking, analysis, and performance optimization loop to push the limit of CUTLASS kernel performance within DL networks.
- Be the authoritative resource on kernel performance in the team and engage with teams across NVIDIA including GPU architecture, DL frameworks, and QA as the performance representative for the CUTLASS team.
Requirements
What you’ll need- Masters or PhD degree in Computer Science, Computer Engineering, or related field (or equivalent experience).
- 3+ years of relevant industry experience.
- Strong programming skills in Python and C++.
- Experience in software performance analysis and optimization.
- Deep understanding of computer architecture and familiarity with GPUs or similar parallel processing architectures.
Benefits
Comp & perks- equity
- benefits 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PythonC++deep learningperformance analysisperformance optimizationGPU architectureparallel processingbenchmarkingCUTLASSmodel adjustments