Senior Research Scientist – FMTA

DeepL

full-time

Posted on: 1/8/2026

Location Type: Hybrid

Location: Berlin • Germany

Visit company website

Explore more

Research Scientist jobs

✨ AI Apply

Apply

Job Level

Senior

Tech Stack

Python PyTorch Tensorflow

About the role

Design, implement, and deploy cutting-edge research in reinforcement learning and post-training at scale, driving innovations that make it into production.
Build and deploy state-of-the-art reinforcement learning pipelines at scale.
Post-train large (multi-modal) models to align them with human intent and enable general capabilities such as reasoning, pushing the boundaries of model performance, safety, and efficiency.
Always keep the entire lifecycle of research and production in mind: from idea conception, theoretical modeling, prototyping, ablation studies, all the way to production deployment.
Build and foster external collaborations with academic and industrial partners.
Follow scientific and technical standards for experimentation, reproducibility, and model evaluation.
Collaborate deeply with Engineering, ML Platform, and HPC teams to deliver robust and reliable model updates to users.

Requirements

A solid mathematical background and enjoy solving challenging problems, evidenced by a master's degree, diploma, PhD, or equivalent industry experience in mathematics, physics, computer science, or a related field.
Deep practical experience in Python and at least one modern machine learning framework such as PyTorch, TensorFlow, or JAX.
A track record of leading self-directed research projects that go well beyond academic exercises and deliver tangible results.
Expertise in deep reinforcement learning (RLHF/RLAIF/RLVR) is a plus.
Hands-on experience scaling and deploying LLMs or other foundation models in real-world systems is a plus.

Benefits

Diverse and internationally distributed team: joining our team means becoming part of a large, global community with people of more than 90 nationalities. We're more than just colleagues; we're a group of professionals with a shared mission to connect diverse cultures.
Open communication, regular feedback: as a language-focused company, we value the importance of clear, honest communication.
Hybrid work, flexible hours: we offer a hybrid work schedule, with team members coming into the office twice a week.
Monthly full-day hacking sessions: every month, we have Hack Fridays, where you can spend your time diving into a project you're passionate about.
30 days of annual leave: we value your peace of mind. With 30 days off (excluding public holidays) and access to mental health resources.
Competitive benefits: we've crafted it to reflect the diversity of our team.
Virtual Shares: An ownership mindset in every role.

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills

reinforcement learningPythondeep reinforcement learningmachine learningmodel evaluationscalingdeploying LLMsprototypingablation studiestheoretical modeling

Soft skills

problem solvingcollaborationleadershipinnovationcommunicationresearch project managementexternal collaborationadaptabilitycritical thinkingcreativity

Certifications

master's degreePhDdiploma