Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
Autodesk

Research Lead – Principal Scientist, Manager Post-Training, Alignment, Reinforcement Learning

Autodesk

Leading research in post-training alignment and reinforcement learning at Autodesk AI Lab. Managing a team of AI scientists to develop reliable foundation models for various industries.

Posted 5/13/2026full-timeRemote • 🇨🇦 CanadaSeniorWebsite

About the role

Key responsibilities & impact
  • Own post-training strategy for model development — from RLHF and preference optimization to agentic systems and long-horizon reasoning
  • Develop novel algorithms that improve model reliability, controllability, and alignment
  • Make principled architectural decisions about when to address challenges at the pre-training, post-training, or system level
  • Design and run experiments that shape model behavior, robustness, and reasoning quality
  • Partner with infrastructure teams to build scalable, reproducible post-training workflows
  • Contribute to publications, patents, and Autodesk's external research visibility
  • Design evaluation frameworks for long-horizon reasoning, tool use, agentic behavior, safety, and real-world workflow completion
  • Lead rigorous model analysis and interpretability efforts
  • Drive human-in-the-loop evaluation with high annotation quality and sound scientific methodology
  • Establish model readiness criteria and provide go/no-go recommendations for releases
  • Manage, mentor, and grow a team of AI scientists
  • Set technical direction and research priorities across post-training and alignment initiatives
  • Foster a research culture grounded in scientific rigor, reproducibility, and fast iteration
  • Help recruit world-class talent across ML, RL, alignment, and foundation models
  • Partner closely with pre-training teams, infrastructure, product organizations, and other stakeholders
  • Translate research trade-offs into clear, decision-ready guidance for leadership

Requirements

What you’ll need
  • Deep hands-on expertise in reinforcement learning for foundation models, and fluency with post-training methods (RLHF, RLAIF, DPO, PPO, or adjacent approaches)
  • Proven experience leading or mentoring technical research teams — whether in an academic lab, AI research organization, or industry setting
  • Strong intuition for model behavior, alignment challenges, and post-training trade-offs
  • Experience designing evaluation systems and thinking rigorously about what it means for a model to be ready
  • Ability to communicate complex technical trade-offs clearly to both technical and non-technical audiences
  • A PhD or equivalent depth of industry research experience in ML, RL, AI, or a related field

Benefits

Comp & perks
  • health insurance
  • retirement plans
  • paid time off
  • flexible work arrangements
  • professional development
  • bonuses
  • stock options
  • equipment allowances
  • wellness programs

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
reinforcement learningRLHFRLAIFDPOPPOmodel reliabilitymodel controllabilitymodel alignmentevaluation frameworksmodel interpretability
Soft Skills
leadershipmentoringcommunicationdecision-makingscientific rigorteam managementcollaborationresearch cultureproblem-solvingclear guidance
Certifications
PhD in MLPhD in RLPhD in AIequivalent industry research experience