NVIDIA

Senior Solutions Architect – AI Compute

NVIDIA

full-time

Posted on:

Location Type: Remote

Location: District of ColumbiaFloridaUnited States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $184,000 - $287,500 per year

Job Level

About the role

  • Primary responsibilities will include deploying, managing, and validating AI Compute/HPC infrastructure in Linux-based environments for new and existing customers.
  • Be the domain expert with partners during planning calls through implementation.
  • Handover-related documentation and perform knowledge transfers required to support customers and partners as they begin rolling out some of the most sophisticated systems in the world!
  • Provide feedback to internal and partners teams such as opening bugs, documenting workarounds, and suggesting improvements.

Requirements

  • 8+ years providing in-depth support and deployment services; solving problems for hardware and software products.
  • Knowledge and experience with Linux system administration, process and package management, task scheduling, kernel management, boot procedures/fixing, performance reporting/optimization/logging, network-routing/advanced networking (tuning and monitoring).
  • Cluster management and provisioning technologies for bare-metal servers (bonus credit for BCM (Base Command Manager)).
  • Minimum of a four-year degree from an accredited university or college in Computer Science, Electrical or Computer Engineering or equivalent experience.
  • Scripting proficiency (Bash, Python, Ansible, etc.).
  • Experience with schedulers such as SLURM, LSF, UGE, etc.
  • Excellent interpersonal skills and the ability to deliver resolutions for customer issues as they arise.
  • An ability to travel to customer sites within the United States up to 30% of the time.
  • Experience with benchmarking tools such as HPL, NCCL tests, MLPerf as well as Kubernetes experience.
Benefits
  • equity
  • benefits 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Linux system administrationprocess managementpackage managementtask schedulingkernel managementboot proceduresperformance reportingnetwork routingscripting (Bash, Python, Ansible)scheduler experience (SLURM, LSF, UGE)
Soft Skills
interpersonal skillsproblem-solvingcustomer supportdocumentationknowledge transfer
Certifications
Bachelor's degree in Computer ScienceBachelor's degree in Electrical EngineeringBachelor's degree in Computer Engineering