Salary
💰 $184,000 - $356,500 per year
Tech Stack
CloudPythonShell Scripting
About the role
- NVIDIA Aerial CUDA Accelerated RAN (ACAR) framework for building high-performance, software-defined, cloud-native RAN functions over NVIDIA CPU/GPU/DPU based systems.
- Seeking a self-motivated senior performance engineer to drive performance and scalability of the platform.
- Optimizing CPU, GPU and NIC sub-systems for predictable low-latency and maximum efficiency.
- Crafting and implementing performance verification tools, frameworks and dashboards.
- Monitoring and prioritizing performance regressions reported by CI/CD.
- Collaborating with multi-functional teams to solve performance bottlenecks in CPU, GPU and NIC sub-systems.
- Benchmarking performance use-cases on different platforms.
Requirements
- BS/MS (or equivalent experience) and 10+ years’ experience, or PhD with 5+ years’ experience, or equivalent.
- Strong software design, development, debugging and testing skills.
- Hands-on experience with performance analysis, characterization and optimization.
- Experience with programming latency sensitive, real-time, multi-threaded applications on CPUs and one or more of GPUs or DSPs or Vector processors.
- Deep knowledge of CPU, DSP or GPU architecture, as well as memory, I/O and networking interfaces.
- Familiarity with data science and using visualization tools to summarize large quantities of data.
- Experience in one or more programming / scripting languages: C/C++, Python, shell scripting.
- CUDA experience highly desired.
- Experience in designing and managing firmware timelines for wireless SoCs used in cellular wireless networks and/or terminals.
- Track record in E2E design/testing of signal processing algorithms at the PHY layer or resource allocation optimization at MAC level.
- Appetite to learn the details of how next generations of GPU will operate and build an outstanding Software-Radio 5G/6G stack.