NVIDIA

Manager, System Software – Triton Inference Server

NVIDIA

full-time

Posted on:

Origin:  • 🇺🇸 United States • California

Visit company website
AI Apply
Apply

Salary

💰 $224,000 - $356,500 per year

Job Level

SeniorLead

Tech Stack

CloudGRPCPython

About the role

  • Guide, mentor, and develop an inclusive and collaborative engineering team focused on delivering robust model serving solutions.
  • Drive planning, prioritization, and execution for projects that improve Triton’s scalability, performance, and reliability in non-generative AI deployments.
  • Foster partnerships with Product and Program Management to create feature roadmaps, manage cross-team dependencies, and balance project resources for both cloud and on-premises platforms.
  • Collaborate with internal collaborators and external customers to understand use cases and convert their needs into product features.
  • Promote engineering excellence through modern, agile development practices and a culture of quality and accountability.

Requirements

  • Master’s or PhD, or equivalent experience, in Computer Science, Computer Engineering, or a related field.
  • Eight or more years of overall hands-on software development experience in customer-facing environments.
  • At least three years building, mentoring, and leading software engineering teams delivering production-grade solutions.
  • Deep background in scalable serving architectures, with direct experience building cloud-native inference APIs, REST/gRPC/protobuf-based services, or similar technologies.
  • Advanced C/C++ and Python development skills, demonstrating clean, object-oriented design, as well as proficiency in debugging, performance optimization, and testing.
  • Track record of contributing to or leading large open-source projects—using GitHub for code reviews, bug tracking, and release management.
  • Strong knowledge of agile methodologies and tools such as JIRA and Linear.
  • Ability to communicate technical topics with clarity and empathy to colleagues, partners, and diverse audiences.
Articul8 AI

Senior/Staff AI Researcher

Articul8 AI
Seniorfull-time🇮🇳 India
Posted: 15 days agoSource: jobs.ashbyhq.com
AWSAzureCloudGoogle Cloud PlatformPython
Q-CTRL

Software Engineer, Backend

Q-CTRL
Mid · Seniorfull-time🇦🇺 Australia
Posted: 17 days agoSource: jobs.lever.co
AWSAzureCloudDistributed SystemsDjangoFlaskGoogle Cloud PlatformGraphQLGRPCMicroservicesPythonRust
Gravie

Senior Software Engineer

Gravie
Seniorfull-time$110k–$183k / year🇺🇸 United States
Posted: 8 hours agoSource: jobs.lever.co
AWSAzureCloudGoogle Cloud PlatformGraphQLJavaMicroservicesMySQLNoSQLPostgresPython
Articul8 AI

Senior/Staff AI Researcher

Articul8 AI
Seniorfull-time🇧🇷 Brazil
Posted: 15 days agoSource: jobs.ashbyhq.com
AWSAzureCloudGoogle Cloud PlatformPython
New Era Technology

Senior Platform Engineer

New Era Technology
Seniorfull-time$75–$80🇺🇸 United States
Posted: 24 days agoSource: boards.greenhouse.io
AWSAzureCloudDockerGoogle Cloud PlatformKubernetesMicroservices.NETPostgresPythonServiceNowSQL+1 more