
Senior AI Network – Compute Consultant
Metsi Technologies
full-time
Posted on:
Location Type: Remote
Location: Poland
Visit company websiteExplore more
Job Level
Tech Stack
About the role
- Collaborate with Dell consultants and client stakeholders to design and implement sophisticated HPC/AI cluster and network architectures.
- Oversee the deployment of high-performance compute environments, ensuring both technical excellence and customer satisfaction throughout each phase of the engagement.
- Design and architect scalable, high-performance compute and network infrastructures for HPC/AI clusters.
- Lead the implementation of advanced networking solutions, including NVIDIA InfiniBand and Ethernet technologies.
- Deploy and manage orchestration tools such as NVIDIA Base Command Manager for cluster management and monitoring.
- Provide expert consulting on compute and network infrastructure strategy, planning, and execution and collaborate with clients to assess technical requirements and deliver customized solutions.
- Troubleshoot and resolve performance bottlenecks across compute, storage, and network layers and develop comprehensive documentation, including architecture diagrams, deployment guides, and operational procedures.
Requirements
- Proven success in designing and deploying large-scale HPC/AI clusters (NVIDIA, AMD, Intel)
- Demonstrated expertise in NVIDIA networking technologies: InfiniBand (Quantum), Ethernet (Spectrum-X), MLNX-OS, NVIDIA Cumulus OS, and Enterprise SONiC
- Proficient in Linux systems administration and scripting
- Extensive hands-on experience with Base Command Manager or equivalent orchestration tools
- Experience in consulting roles with strong communication and documentation abilities and capacity to manage multiple projects independently and deliver results within dynamic environments
- Certifications in networking and Linux (e.g., CCNP, LFCS, NCP-AIN, NCP-AIO), experience with NVIDIA DGX systems or similar GPU platforms and familiarity with container orchestration technologies (e.g., Kubernetes, Docker, Slurm)
- Knowledge of data centre operations and cloud integration methods and experience with GENAI frameworks and related tools
Benefits
- Dell Technologies is committed to empowering our customers with innovative products and services that enhance their performance and productivity.
- Health insurance
- Retirement plans
- Paid time off
- Flexible work arrangements
- Professional development opportunities
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
HPC cluster designAI cluster deploymentNVIDIA InfiniBandEthernet technologiesLinux systems administrationscriptingorchestration toolsBase Command Managercontainer orchestrationperformance troubleshooting
Soft Skills
communicationdocumentationproject managementcollaborationcustomer satisfactionconsultingproblem-solvingindependent workadaptabilitystrategic planning
Certifications
CCNPLFCSNCP-AINNCP-AIO