Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
NVIDIA

Senior Performance Compiler Engineer – Triton

NVIDIA

Senior Performance Compiler Engineer working on open-source Triton compiler project. Investigating GPU architectures and designing compiler technology for efficient AI applications.

Posted 5/8/2026full-timeRemote • California, Texas, Washington • 🇺🇸 United StatesSenior💰 $184,000 - $287,500 per yearWebsite

Tech Stack

Tools & technologies
AssemblyPython

About the role

Key responsibilities & impact
  • Investigating the latest and future NVIDIA GPU hardware architecture and programming models.
  • Working on the frontier of AI by understanding advanced algorithms (like attention sinks and MoEs) and numerics (like block-scaled floating point) to identify new opportunities for optimization.
  • Designing and implementing compiler technology using MLIR to optimize high-level kernel descriptions (written in Triton's Python DSL), with a focus on generating efficient, low-level GPU code.
  • Engaging in a dynamic, iterative process of optimization—sometimes starting with the kernel, sometimes with the compiler—to find the most efficient path to peak performance.
  • Collaborating with teams across NVIDIA, including hardware architects and the CUDA compiler team, to influence future products and ensure we are always operating at maximum efficiency.

Requirements

What you’ll need
  • Bachelor, Masters or Ph.D. degree or equivalent experience in Computer Science, Computer Engineering, Applied Math, or a related field.
  • 8+ years of relevant industry experience in software development.
  • Demonstrated strong C++ programming and software design skills, with an emphasis on performance analysis and debugging.
  • Experienced in parallel programming, including CUDA/OpenCL GPU programming or other parallel models such as OpenMP.
  • Solid understanding of computer architecture and hands-on experience with assembly-level programming.

Benefits

Comp & perks
  • equity
  • health insurance

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
C++MLIRTritonCUDAOpenCLOpenMPperformance analysisdebuggingassembly-level programmingcomputer architecture
Soft Skills
collaborationcommunicationproblem-solvingadaptabilityteamworkanalytical thinkingcreativityinitiativeattention to detailiterative process