
Research Engineer, Environment Scaling
Anthropic
full-time
Posted on:
Location Type: Hybrid
Location: United States
Visit company websiteExplore more
Salary
💰 $350,000 - $850,000 per year
About the role
- Improve and execute fine-tuning strategies for adapting Claude to new domains
- Manage technical relationships with external data vendors
- Collaborate with domain experts to design data pipelines
- Explore creating RL environments for high value tasks
- Develop and improve QA frameworks to catch reward hacking
- Partner with RL research and product teams
Requirements
- Experience with fine-tuning large language models for specific domains or real-world use cases
- Experience with reinforcement learning, reward design, or training data curation for LLMs
- Comfortable managing technical vendor relationships
- Strong project management and interpersonal skills
- Passionate about making AI more useful and accessible across industries
- Excited about a role that includes ML research, data operations, and project management
Benefits
- Competitive compensation
- Generous vacation
- Parental leave
- Flexible working hours
- Lovely office space
- Optional equity donation matching
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
fine-tuninglarge language modelsreinforcement learningreward designtraining data curationdata pipelinesQA frameworksreward hackingmachine learning researchdata operations
Soft Skills
project managementinterpersonal skills