AI Research Scientist

• Lead high-impact research on data quality frameworks for post-training LLMs — including techniques for preference consistency, label reliability, annotator calibration, and dataset auditing.
• Design and implement systems for identifying noisy, low-value, or adversarial data points in human feedback and synthetic comparison datasets.
• Drive strategy for aligning data collection, curation, and filtering with post-training objectives such as helpfulness, harmlessness, and faithfulness.
• Collaborate cross-functionally with engineers, alignment researchers, and product leaders to translate research into production-ready pipelines for RLHF and DPO.
• Mentor and influence junior researchers and engineers working on data-centric evaluation, reward modeling, and benchmark creation.
• Author foundational tools and metrics that connect supervision data characteristics to downstream LLM behavior and evaluation performance.
• Publish and present research that advances the field of data quality in LLM post-training, contributing to academic and industry best practices.

Staff AI Research Scientist, Data Quality

Staff AI Research Scientist – Evaluation