
Human Evaluation Program Manager
Netflix
full-time
Posted on:
Location Type: Remote
Location: United States
Visit company websiteExplore more
Salary
💰 $230,000 - $340,000 per year
About the role
- Lead end-to-end execution of human evaluation and data operations initiatives—from intake and scoping to delivery
- Develop and operationalize frameworks for evaluating GenAI and ML outputs
- Collaborate across research, product, UX, and engineering to embed evaluation into model development cycles
- Build and maintain project timelines, proactively manage blockers, and ensure timely execution
- Develop clear, scalable guidelines and scoring rubrics to ensure consistent rater judgment
- Oversee rater onboarding, calibration, and QA workflows
- Define and monitor success metrics such as speed to IRR, throughput, and task effectiveness
- Pilot and refine evaluation tasks to improve clarity, inter-rater reliability, and feedback quality
- Build foundational documentation and drive adoption of best practices across teams
- Track evaluation health and proactively communicate progress to stakeholders clearly and proactively
- Anticipate and proactively resolve bottlenecks and blockers
- Act as the connective tissue across multiple partners to ensure alignment and effective execution of evaluations at scale
Requirements
- 4+ years of experience working in human evaluations, data collection, labeling, or annotation operations in GenAI/ML environments
- Track record of implementing process improvements or quality control systems for data collection needs
- Prior experience managing human annotation vendors, raters, or data labeling teams
- Strong understanding of evaluation design, including guidelines, rubrics, and scoring protocols
- Proven ability in end-to-end management of complex, cross-functional programs, demonstrating strong Program Management skills and clear accountability for successful delivery
- Experience with human labeling platforms
- Excellent written and verbal communication skills
- Ability to synthesize feedback into clear recommendations and process improvements
- Familiarity with responsible AI principles and how to embed them into evaluation design
- Strong organizational skills and executional focus; ability to track details while seeing the bigger picture.
Benefits
- 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
human evaluationdata operationsevaluation designprocess improvementsquality control systemsscoring protocolscross-functional program managementdata labelingevaluation metricsinter-rater reliability
Soft Skills
communication skillsorganizational skillsexecutional focusproblem-solvingcollaborationsynthesis of feedbackproactive managementaccountabilityadaptabilitystakeholder management