NVIDIA

Principal Software Engineer – CSP Engagements

NVIDIA

full-time

Posted on:

Origin:  • 🇺🇸 United States • California

Visit company website
AI Apply
Manual Apply

Salary

💰 $272,000 - $425,500 per year

Job Level

Lead

Tech Stack

CloudLinux

About the role

  • Drive system software architecture alignment and technical deep dives, acting as the primary software engineering contact for NPI projects with key customers
  • Collaborate with major customers to understand their roadmap, use cases, and requirements, aligning them with NVIDIA’s roadmap
  • Spearhead cross functional efforts to resolve complex and high-profile customer issues during NPI phase
  • Make key technical decisions even when faced with ambiguity and mitigate execution risks by following left shift strategy
  • Build and maintain customer trust by understanding and addressing their needs
  • Work closely with cross-functional architects in defining system software architecture for complex server platforms

Requirements

  • Extensive experience in designing scalable, high-performance server systems at the SW/HW interface
  • Expertise in server system architecture and its impact on application performance
  • Proven leadership skills with strong project ownership in complex software and hardware environments
  • Deep understanding of computer architecture, microprocessor concepts, and expert knowledge of ARM (aarch64) and x86 architectures
  • Proficient in system software design, OS fundamentals, Linux kernel device drivers, and low-level hardware/software interfaces
  • Skilled in complex system-level debugging, performance analysis, and test design
  • BS or MS in Computer Engineering, Computer Science, or related field, or equivalent experience
  • Over 15 years in system software architecture and development
  • Knowledge of cloud and cluster level deployment and management systems
  • Expertise in Out of Band and In-band management architectures
  • Experience with GPU computing (CUDA) and deep learning workloads
  • Knowledge of Memory fabric and CXL architectures