FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
About the role
Key responsibilities & impact- Drive and develop a high-performing team of research scientists.
- Own the team's research roadmap in Reinforcement Learning, balancing near-term product commitments with longer-horizon research bets.
- Define and communicate your team's strategic goals, roadmap, and success metrics in collaboration with key stakeholders.
- Act as the primary technical interface between the Task Adaptation team and adjacent functions.
- Play an active role in identifying, assessing, and recruiting research talent as the team grows.
- Operate with a high degree of autonomy, defining the team's direction and pushing for results in a dynamic environment.
Requirements
What you’ll need- You hold a PhD in Computer Science, Mathematics, Physics, or a comparable quantitative discipline, or possess a strong machine learning background with equivalent research depth.
- You have a strong foundation in deep learning and large-scale model training, preferably with experience in post-training of large language models.
- You have proven experience leading a team of researchers or ML engineers, with a track record of developing talent and maintaining delivery rigour.
- You are comfortable operating across the full model development lifecycle, including understanding infrastructure and compute constraints.
- You have excellent communication skills and the ability to translate complex research direction into clear goals for both technical and non-technical stakeholders.
- You are solution-oriented and decisive, able to navigate ambiguity and drive outcomes without waiting for direction.
Benefits
Comp & perks- Diverse and internationally distributed team: joining our team means becoming part of a large, global community with people of more than 90 nationalities.
- Open communication, regular feedback: we value clear, honest communication, smooth collaboration, direct feedback, and leading with empathy and growth mindset.
- Hybrid work, flexible hours: flexible working hours and trust in your productivity with a hybrid schedule, engaging directly with your team in the office twice a week.
- Virtual Shares: every employee receives Virtual Shares, linking your contribution directly to DeepL’s growth and rewarding you with a stake in our future.
- Regular in-person team events: vibrant events that bring us all together, from local gatherings to company-wide events.
- Monthly full-day hacking sessions: spend your time diving into a project you're passionate about every month during Hack Fridays.
- 30 days of annual leave: access to mental health resources and 30 days off (excluding public holidays).
- Competitive benefits: our benefits package is tailored to align with your unique location, ensuring you feel supported every step of the way.
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Reinforcement Learningdeep learninglarge-scale model trainingpost-training of large language modelsmodel development lifecyclemachine learning
Soft Skills
leadershipcommunicationsolution-orienteddecisiveability to navigate ambiguity
Certifications
PhD in Computer SciencePhD in MathematicsPhD in Physics
