Nebius Group

System Engineer – Token Factory

Nebius Group

full-time

Posted on:

Location Type: Remote

Location: Netherlands

Visit company website

Explore more

AI Apply
Apply

About the role

  • Develop and optimize low-level kernels and runtime components for AI inference
  • Improve performance of inference engines GPU platforms
  • Profile and debug system-level and hardware-level performance issues
  • Integrate support for new hardware architectures (Hopper, Blackwell, Rubin)
  • Collaborate with ML and backend teams to optimize end-to-end execution

Requirements

  • Strong proficiency in C++ , OR expertise in GPU programming with a focus on low-level high-performance coding and memory management
  • Experience in GPU programming or systems-level software development, e.g. operating system internals, kernel modules, or device drivers
  • Hands-on experience with profiling and debugging tools to identify performance issues on both CPUs and GPUs, and the ability to optimize code based on those findings.
  • Solid understanding of CPU/GPU architecture and memory hierarchy.
Benefits
  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements.
  • A dynamic and collaborative work environment that values initiative and innovation.

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
C++GPU programminglow-level codinghigh-performance codingmemory managementsystems-level software developmentoperating system internalskernel modulesdevice driversprofiling and debugging
Soft skills
collaborationoptimization