NVIDIA

Senior Software Engineer, Aerial – Performance

NVIDIA

full-time

Posted on:

Origin:  • 🇺🇸 United States • California, North Carolina, Oregon

Visit company website
AI Apply
Manual Apply

Salary

💰 $184,000 - $356,500 per year

Job Level

Senior

Tech Stack

CloudPythonShell Scripting

About the role

  • NVIDIA Aerial CUDA Accelerated RAN (ACAR) framework for building high-performance, software-defined, cloud-native RAN functions over NVIDIA CPU/GPU/DPU based systems.
  • Seeking a self-motivated senior performance engineer to drive performance and scalability of the platform.
  • Optimizing CPU, GPU and NIC sub-systems for predictable low-latency and maximum efficiency.
  • Crafting and implementing performance verification tools, frameworks and dashboards.
  • Monitoring and prioritizing performance regressions reported by CI/CD.
  • Collaborating with multi-functional teams to solve performance bottlenecks in CPU, GPU and NIC sub-systems.
  • Benchmarking performance use-cases on different platforms.

Requirements

  • BS/MS (or equivalent experience) and 10+ years’ experience, or PhD with 5+ years’ experience, or equivalent.
  • Strong software design, development, debugging and testing skills.
  • Hands-on experience with performance analysis, characterization and optimization.
  • Experience with programming latency sensitive, real-time, multi-threaded applications on CPUs and one or more of GPUs or DSPs or Vector processors.
  • Deep knowledge of CPU, DSP or GPU architecture, as well as memory, I/O and networking interfaces.
  • Familiarity with data science and using visualization tools to summarize large quantities of data.
  • Experience in one or more programming / scripting languages: C/C++, Python, shell scripting.
  • CUDA experience highly desired.
  • Experience in designing and managing firmware timelines for wireless SoCs used in cellular wireless networks and/or terminals.
  • Track record in E2E design/testing of signal processing algorithms at the PHY layer or resource allocation optimization at MAC level.
  • Appetite to learn the details of how next generations of GPU will operate and build an outstanding Software-Radio 5G/6G stack.