Anthropic

Research Engineer, Environment Scaling

Anthropic

full-time

Posted on:

Location Type: Hybrid

Location: United States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $350,000 - $850,000 per year

About the role

  • Improve and execute fine-tuning strategies for adapting Claude to new domains
  • Manage technical relationships with external data vendors
  • Collaborate with domain experts to design data pipelines
  • Explore creating RL environments for high value tasks
  • Develop and improve QA frameworks to catch reward hacking
  • Partner with RL research and product teams

Requirements

  • Experience with fine-tuning large language models for specific domains or real-world use cases
  • Experience with reinforcement learning, reward design, or training data curation for LLMs
  • Comfortable managing technical vendor relationships
  • Strong project management and interpersonal skills
  • Passionate about making AI more useful and accessible across industries
  • Excited about a role that includes ML research, data operations, and project management
Benefits
  • Competitive compensation
  • Generous vacation
  • Parental leave
  • Flexible working hours
  • Lovely office space
  • Optional equity donation matching
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
fine-tuninglarge language modelsreinforcement learningreward designtraining data curationdata pipelinesQA frameworksreward hackingmachine learning researchdata operations
Soft Skills
project managementinterpersonal skills