Metsi Technologies

Senior AI Network – Compute Consultant

Metsi Technologies

full-time

Posted on:

Location Type: Remote

Location: Poland

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Collaborate with Dell consultants and client stakeholders to design and implement sophisticated HPC/AI cluster and network architectures.
  • Oversee the deployment of high-performance compute environments, ensuring both technical excellence and customer satisfaction throughout each phase of the engagement.
  • Design and architect scalable, high-performance compute and network infrastructures for HPC/AI clusters.
  • Lead the implementation of advanced networking solutions, including NVIDIA InfiniBand and Ethernet technologies.
  • Deploy and manage orchestration tools such as NVIDIA Base Command Manager for cluster management and monitoring.
  • Provide expert consulting on compute and network infrastructure strategy, planning, and execution and collaborate with clients to assess technical requirements and deliver customized solutions.
  • Troubleshoot and resolve performance bottlenecks across compute, storage, and network layers and develop comprehensive documentation, including architecture diagrams, deployment guides, and operational procedures.

Requirements

  • Proven success in designing and deploying large-scale HPC/AI clusters (NVIDIA, AMD, Intel)
  • Demonstrated expertise in NVIDIA networking technologies: InfiniBand (Quantum), Ethernet (Spectrum-X), MLNX-OS, NVIDIA Cumulus OS, and Enterprise SONiC
  • Proficient in Linux systems administration and scripting
  • Extensive hands-on experience with Base Command Manager or equivalent orchestration tools
  • Experience in consulting roles with strong communication and documentation abilities and capacity to manage multiple projects independently and deliver results within dynamic environments
  • Certifications in networking and Linux (e.g., CCNP, LFCS, NCP-AIN, NCP-AIO), experience with NVIDIA DGX systems or similar GPU platforms and familiarity with container orchestration technologies (e.g., Kubernetes, Docker, Slurm)
  • Knowledge of data centre operations and cloud integration methods and experience with GENAI frameworks and related tools
Benefits
  • Dell Technologies is committed to empowering our customers with innovative products and services that enhance their performance and productivity.
  • Health insurance
  • Retirement plans
  • Paid time off
  • Flexible work arrangements
  • Professional development opportunities
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
HPC cluster designAI cluster deploymentNVIDIA InfiniBandEthernet technologiesLinux systems administrationscriptingorchestration toolsBase Command Managercontainer orchestrationperformance troubleshooting
Soft Skills
communicationdocumentationproject managementcollaborationcustomer satisfactionconsultingproblem-solvingindependent workadaptabilitystrategic planning
Certifications
CCNPLFCSNCP-AINNCP-AIO