
Senior Research Scientist – FMTA
DeepL
full-time
Posted on:
Location Type: Hybrid
Location: Berlin • Germany
Visit company websiteExplore more
Job Level
Tech Stack
About the role
- Design, implement, and deploy cutting-edge research in reinforcement learning and post-training at scale, driving innovations that make it into production.
- Build and deploy state-of-the-art reinforcement learning pipelines at scale.
- Post-train large (multi-modal) models to align them with human intent and enable general capabilities such as reasoning, pushing the boundaries of model performance, safety, and efficiency.
- Always keep the entire lifecycle of research and production in mind: from idea conception, theoretical modeling, prototyping, ablation studies, all the way to production deployment.
- Build and foster external collaborations with academic and industrial partners.
- Follow scientific and technical standards for experimentation, reproducibility, and model evaluation.
- Collaborate deeply with Engineering, ML Platform, and HPC teams to deliver robust and reliable model updates to users.
Requirements
- A solid mathematical background and enjoy solving challenging problems, evidenced by a master's degree, diploma, PhD, or equivalent industry experience in mathematics, physics, computer science, or a related field.
- Deep practical experience in Python and at least one modern machine learning framework such as PyTorch, TensorFlow, or JAX.
- A track record of leading self-directed research projects that go well beyond academic exercises and deliver tangible results.
- Expertise in deep reinforcement learning (RLHF/RLAIF/RLVR) is a plus.
- Hands-on experience scaling and deploying LLMs or other foundation models in real-world systems is a plus.
Benefits
- Diverse and internationally distributed team: joining our team means becoming part of a large, global community with people of more than 90 nationalities. We're more than just colleagues; we're a group of professionals with a shared mission to connect diverse cultures.
- Open communication, regular feedback: as a language-focused company, we value the importance of clear, honest communication.
- Hybrid work, flexible hours: we offer a hybrid work schedule, with team members coming into the office twice a week.
- Monthly full-day hacking sessions: every month, we have Hack Fridays, where you can spend your time diving into a project you're passionate about.
- 30 days of annual leave: we value your peace of mind. With 30 days off (excluding public holidays) and access to mental health resources.
- Competitive benefits: we've crafted it to reflect the diversity of our team.
- Virtual Shares: An ownership mindset in every role.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
reinforcement learningPythondeep reinforcement learningmachine learningmodel evaluationscalingdeploying LLMsprototypingablation studiestheoretical modeling
Soft skills
problem solvingcollaborationleadershipinnovationcommunicationresearch project managementexternal collaborationadaptabilitycritical thinkingcreativity
Certifications
master's degreePhDdiploma