Design and create tools and workflows for agent development that support rapid prototyping—define agents, compose toolchains, and construct reasoning loops with minimal overhead.
Build platform solutions to support scalable experimentation, synthetic dataset generation, and multi-agent evaluation across diverse tasks and domains.
Develop feedback and optimization pipelines that incorporate both automated metrics and human-in-the-loop evaluation signals to fine-tune agent behavior.
Implement and scale optimization techniques such as Direct Preference Optimization (DPO), Proximal Policy Optimization (PPO), and reward modeling to improve agent performance.
Launch and support fine-tuned models in production environments with robust evaluation, rollback strategies, and performance monitoring.
Collaborate closely with applied AI/ML teams to translate state-of-the-art research in agentic reasoning, planning, and tool use into reliable, production-ready systems.
Requirements
Strong technical expertise in software development, with understanding of agentic workflows—including reasoning loops, tool invocation, memory, and orchestration of autonomous AI agents.
Hands-on experience using Large Language Models, including prompt engineering, fine-tuning, model distillation, and deploying optimized models (e.g. via DPO, PPO) into production environments.
Proven ability to build and scale ML/AI systems, from experimentation to deployment—owning dataset generation, evaluation pipelines, A/B testing, and performance monitoring.
Leadership and mentorship capabilities, with a track record of guiding complex technical projects and supporting the growth of teammates through code/design reviews and technical direction.
Excellent communication and collaboration skills, with the ability to translate technical ideas into actionable plans and work effectively with cross-functional partners, including product and infrastructure teams.
Innovation mindset and commitment to continuous learning and a bias toward action, staying at the forefront of ML/AI trends, agentic systems research, and best practices in tooling, safety, and evaluation.
Benefits
Market competitive and pay equity-focused compensation structure
In addition to the base pay range listed below, this role is also eligible for bonus opportunities + equity + benefits
100% paid health insurance for employees with 90% coverage for dependents
Annual lifestyle wallet for personal wellness, learning and development, and more!
Lifetime maximum benefit for family forming and fertility benefits
Dedicated mental health support for employees and eligible dependents
Generous time away including company holidays, paid time off, sick time, parental leave, and more!
Lively office environment with catered meals, fully stocked kitchens, and geo-specific commuter benefits
ATS Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.