Salary
💰 $295,000 - $530,000 per year
About the role
- Conduct research around personality and model behavior, leveraging and developing tools such as synthetic data, reinforcement learning, and reasoning to shape personality and behavior of models (e.g., o3, o4-mini, 4o)
- Build evaluations and pipelines to facilitate development and research
- Innovate new post-training methods
- Integrate research into OpenAI products used by hundreds of millions of users
- Collaborate with the Personality & Model Behavior and Post-training teams to apply research findings
- Dive into large ML codebases to implement and debug research systems
Requirements
- Expertise in reinforcement learning, machine learning, and natural language processing
- Deep understanding of machine learning and its applications
- Prior knowledge in training and optimizing models and building evaluations
- Experience with synthetic data, reinforcement learning, and reasoning approaches
- Ability to dive into large ML codebases to debug issues
- Track record of delivering innovative, out-of-the-box solutions to address real-world constraints
- Ability to thrive in dynamic and technically complex environments
- Passion for tackling open-ended research challenges and integrating research into products
- Willingness/ability to work from a US office (asked if able to work from US office three days per week)