
Principal Software Engineer – Scale-Up Networking, GPU-Centric
Hewlett Packard Enterprise
full-time
Posted on:
Location Type: Hybrid
Location: Bangalore • India
Visit company websiteExplore more
Job Level
About the role
- Architect & Deliver Scale-Up Networking
- Design and implement GPU-aware networking paths for high-bandwidth, low-latency intra-node communication
- Develop and optimize GPU → NIC → GPU data movement, shared memory models, and DMA pathways
- Work with NVIDIA CUDA, NVLink, NCCL, and AMD ROCm, InfinityFabric, RCCL teams to integrate and optimize scale-up communication semantics
- Drive improvements to DMA engines, BAR mappings, ATS/IOMMU, and GPU memory registration workflows
- Enhance and extend Libfabric, UCX, CXI, SHMEMX, OpenMPI for GPU-accelerated scale-up workflows
- Optimize communication collectives, transport layers, and GPU-direct capabilities
- Characterize and tune multi-NIC per socket, NUMA-zone mapping, GPU locality, CQ/queue design, and CPU/GPU topology optimization
- Lead upstream contributions to open-source projects (OFI, UCX, OpenMPI, RCCL/NCCL enablement)
- Partner with HPC/AI ecosystem teams to shape future architectures
- Own complex debugging across driver, runtime, GPU, kernel, and user-space boundaries
- Develop profiling workflows using Nsight, ROCm tools, eBPF, perf, etc.
Requirements
- 10–15+ years building high-performance networking, GPU, or kernel-level software
- Deep expertise in C/C++, Linux internals, memory management, RDMA, PCIe, IOMMU, ATS, DMA engines
- Strong understanding of CUDA, ROCm, GPU memory models, P2P, GDS (GPUDirect Storage), GDR (GPUDirect RDMA)
- Hands-on experience with MPI, SHMEM, Libfabric, UCX, or similar communication stacks
- Proven experience driving architecture, cross-org technical decisions, and upstream contributions
- Ability to mentor senior engineers, influence multi-team designs, and own end-to-end delivery.
Benefits
- Health & Wellbeing
- Personal & Professional Development
- Unconditional Inclusion
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
C/C++Linux internalsmemory managementRDMAPCIeIOMMUDMA enginesCUDAROCmMPI
Soft Skills
mentoringinfluencingtechnical decision-makingend-to-end delivery