
System Engineer – Token Factory
Nebius Group
full-time
Posted on:
Location Type: Remote
Location: Netherlands
Visit company websiteExplore more
About the role
- Develop and optimize low-level kernels and runtime components for AI inference
- Improve performance of inference engines GPU platforms
- Profile and debug system-level and hardware-level performance issues
- Integrate support for new hardware architectures (Hopper, Blackwell, Rubin)
- Collaborate with ML and backend teams to optimize end-to-end execution
Requirements
- Strong proficiency in C++ , OR expertise in GPU programming with a focus on low-level high-performance coding and memory management
- Experience in GPU programming or systems-level software development, e.g. operating system internals, kernel modules, or device drivers
- Hands-on experience with profiling and debugging tools to identify performance issues on both CPUs and GPUs, and the ability to optimize code based on those findings.
- Solid understanding of CPU/GPU architecture and memory hierarchy.
Benefits
- Competitive salary and comprehensive benefits package.
- Opportunities for professional growth within Nebius.
- Flexible working arrangements.
- A dynamic and collaborative work environment that values initiative and innovation.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
C++GPU programminglow-level codinghigh-performance codingmemory managementsystems-level software developmentoperating system internalskernel modulesdevice driversprofiling and debugging
Soft skills
collaborationoptimization