DeepL

Senior Research Scientist – FMTA

DeepL

full-time

Posted on:

Location Type: Hybrid

Location: BerlinGermany

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Design, implement, and deploy cutting-edge research in reinforcement learning and post-training at scale, driving innovations that make it into production.
  • Build and deploy state-of-the-art reinforcement learning pipelines at scale.
  • Post-train large (multi-modal) models to align them with human intent and enable general capabilities such as reasoning, pushing the boundaries of model performance, safety, and efficiency.
  • Always keep the entire lifecycle of research and production in mind: from idea conception, theoretical modeling, prototyping, ablation studies, all the way to production deployment.
  • Build and foster external collaborations with academic and industrial partners.
  • Follow scientific and technical standards for experimentation, reproducibility, and model evaluation.
  • Collaborate deeply with Engineering, ML Platform, and HPC teams to deliver robust and reliable model updates to users.

Requirements

  • A solid mathematical background and enjoy solving challenging problems, evidenced by a master's degree, diploma, PhD, or equivalent industry experience in mathematics, physics, computer science, or a related field.
  • Deep practical experience in Python and at least one modern machine learning framework such as PyTorch, TensorFlow, or JAX.
  • A track record of leading self-directed research projects that go well beyond academic exercises and deliver tangible results.
  • Expertise in deep reinforcement learning (RLHF/RLAIF/RLVR) is a plus.
  • Hands-on experience scaling and deploying LLMs or other foundation models in real-world systems is a plus.
Benefits
  • Diverse and internationally distributed team: joining our team means becoming part of a large, global community with people of more than 90 nationalities. We're more than just colleagues; we're a group of professionals with a shared mission to connect diverse cultures.
  • Open communication, regular feedback: as a language-focused company, we value the importance of clear, honest communication.
  • Hybrid work, flexible hours: we offer a hybrid work schedule, with team members coming into the office twice a week.
  • Monthly full-day hacking sessions: every month, we have Hack Fridays, where you can spend your time diving into a project you're passionate about.
  • 30 days of annual leave: we value your peace of mind. With 30 days off (excluding public holidays) and access to mental health resources.
  • Competitive benefits: we've crafted it to reflect the diversity of our team.
  • Virtual Shares: An ownership mindset in every role.

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
reinforcement learningPythondeep reinforcement learningmachine learningmodel evaluationscalingdeploying LLMsprototypingablation studiestheoretical modeling
Soft skills
problem solvingcollaborationleadershipinnovationcommunicationresearch project managementexternal collaborationadaptabilitycritical thinkingcreativity
Certifications
master's degreePhDdiploma