Nebius Group

Senior ML Solutions Architect – Token Factory

Nebius Group

full-time

Posted on:

Location Type: Remote

Location: Netherlands

Visit company website

Explore more

AI Apply
Apply

Job Level

Tech Stack

About the role

  • Design and implement LLM-based solutions using Nebius Token Factory’s inference services to drive business value and support customer goals.
  • Build production-ready applications leveraging our serverless LLM APIs, including multimodal models (text, vision, audio) and domain-specific models.
  • Provide technical expertise in prompt engineering, RAG architectures, model selection, and inference optimization.
  • Collaborate with product and engineering teams to surface customer feedback and shape the platform roadmap.
  • Guide customers in scaling from POC to production with a focus on performance, reliability, and cost efficiency.

Requirements

  • 5+ years of experience in ML/AI systems, with at least 2 years focused on LLMs and generative AI.
  • Deep knowledge of the LLM ecosystem, including model architectures and fine-tuning approaches.
  • Hands-on experience with:
  • Prompt engineering and LLM pipeline development, including evaluation.
  • Agentic frameworks such as Langchain, Langsmith, smolagents, or equivalent.
  • Vector databases and RAG implementation patterns.
  • Deploying LLM-powered applications using APIs from OpenAI, Anthropic, or open-source models.
  • Strong Python programming skills.
  • Excellent communication skills, with the ability to clearly explain technical concepts to diverse audiences.
Benefits
  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements.
  • A dynamic and collaborative work environment that values initiative and innovation.
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
ML systemsAI systemsLLMsgenerative AIprompt engineeringLLM pipeline developmentPython programmingmodel architecturesfine-tuning approachesinference optimization
Soft Skills
communication skills