
HPC Technical Lead
General Dynamics Information Technology
full-time
Posted on:
Location Type: Remote
Location: United States
Visit company websiteExplore more
Salary
💰 $182,750 - $247,250 per year
Job Level
Tech Stack
About the role
- Serve as the primary technical authority for all HPC operational matters.
- Lead and drive root-cause analysis and problem resolution for critical incidents across compute, storage, and interconnect components.
- Monitor, analyze, and optimize performance across compute nodes, interconnects, and storage subsystems.
- Conduct proactive health checks and performance tuning to ensure system readiness for 24×7 mission-critical NOAA workloads.
- Lead technical planning and readiness reviews for all system upgrades, patches, and enhancements.
- Maintain configuration baselines and ensure compliance with security requirements (e.g., RMF/STIG).
- Act as the primary technical point of contact for NWS/NOAA for detailed HPC discussions, system behavior, and operational issues.
- Coordinate with NOAA scientific teams to understand modeling workload needs and optimize HPC resources accordingly.
Requirements
- 10+ years of hands-on HPC systems administration and troubleshooting experience, including Cray, SGI, or comparable large-scale systems.
- Extensive experience supporting Federal HPC environments, demonstrating readiness for NOAA/NWS operational environments.
- Deep Linux expertise, including SLES, RHEL, and CentOS across multiple HPC platforms.
- Strong technical experience with HPC storage (e.g., Lustre), interconnects (e.g., InfiniBand), and performance tuning of large-scale computing systems.
- Proven leadership of HPC technical teams, including mentoring and directing system administrators and engineers supporting very large core.
- Demonstrated success performing root cause analysis, escalated troubleshooting, and incident recovery in production HPC environments.
- Experience implementing STIG/RMF security controls across HPC systems and applying DoD-grade configuration compliance.
- Excellent communication skills, capable of translating complex technical issues to customers and stakeholders.
Benefits
- Our benefits package for all US-based employees includes a variety of medical plan options, some with Health Savings Accounts
- dental plan options
- a vision plan
- a 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match
- to encourage work/life balance, GDIT offers employees full flex work weeks where possible and a variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave
- GDIT typically provides new employees with 15 days of paid leave per calendar year to be used for vacations, personal business, and illness and an additional 10 paid holidays per year
- Paid leave and paid holidays are prorated based on the employee’s date of hire
- The GDIT Paid Family Leave program provides a total of up to 160 hours of paid leave in a rolling 12 month period for eligible employees.
- To ensure our employees are able to protect their income, other offerings such as short and long-term disability benefits, life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance are provided or available.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
HPC systems administrationtroubleshootingLinuxSLESRHELCentOSHPC storageLustreInfiniBandperformance tuning
Soft Skills
leadershipmentoringcommunicationproblem resolutiontechnical planningcollaboration