
Senior Solutions Architect – AI Compute
NVIDIA
full-time
Posted on:
Location Type: Remote
Location: District of Columbia • Florida • United States
Visit company websiteExplore more
Salary
💰 $184,000 - $287,500 per year
Job Level
Tech Stack
About the role
- Primary responsibilities will include deploying, managing, and validating AI Compute/HPC infrastructure in Linux-based environments for new and existing customers.
- Be the domain expert with partners during planning calls through implementation.
- Handover-related documentation and perform knowledge transfers required to support customers and partners as they begin rolling out some of the most sophisticated systems in the world!
- Provide feedback to internal and partners teams such as opening bugs, documenting workarounds, and suggesting improvements.
Requirements
- 8+ years providing in-depth support and deployment services; solving problems for hardware and software products.
- Knowledge and experience with Linux system administration, process and package management, task scheduling, kernel management, boot procedures/fixing, performance reporting/optimization/logging, network-routing/advanced networking (tuning and monitoring).
- Cluster management and provisioning technologies for bare-metal servers (bonus credit for BCM (Base Command Manager)).
- Minimum of a four-year degree from an accredited university or college in Computer Science, Electrical or Computer Engineering or equivalent experience.
- Scripting proficiency (Bash, Python, Ansible, etc.).
- Experience with schedulers such as SLURM, LSF, UGE, etc.
- Excellent interpersonal skills and the ability to deliver resolutions for customer issues as they arise.
- An ability to travel to customer sites within the United States up to 30% of the time.
- Experience with benchmarking tools such as HPL, NCCL tests, MLPerf as well as Kubernetes experience.
Benefits
- equity
- benefits 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Linux system administrationprocess managementpackage managementtask schedulingkernel managementboot proceduresperformance reportingnetwork routingscripting (Bash, Python, Ansible)scheduler experience (SLURM, LSF, UGE)
Soft Skills
interpersonal skillsproblem-solvingcustomer supportdocumentationknowledge transfer
Certifications
Bachelor's degree in Computer ScienceBachelor's degree in Electrical EngineeringBachelor's degree in Computer Engineering