Machine Learning Engineer – User Understanding

Zillow

full-time

Posted on: 12/9/2025

Location Type: Remote

Location: Remote • California, Connecticut, District of Columbia, Maryland, Massachusetts, New Jersey, New York, Washington • 🇺🇸 United States

✨ AI Apply

💰 $145,500 - $232,500 per year

Mid-LevelSenior

About the role

Build and maintain data pipelines for LLM (Large Language Model) training and evaluation, curate user-understanding signals (such as intents, preferences, and behavioral features), and ensure data quality, privacy, and proper dataset management.
Develop and manage labeling and feedback loops, including heuristics, annotation jobs, and prompt-based labeling, to create high-quality corpora, collaborating with Data Engineering and Applied Science partners to improve data coverage and reduce noise.
Design, prototype, and ship to production agentic AI solutions, including multi-agent systems using frameworks like LangGraph, and implement context-aware features in partnership with senior engineers.
Implement an evaluation framework to measure model quality on offline test sets (accuracy, bias, safety, user-intent coverage), and build dashboards to track improvements over time.
Lead and contribute to experimentation by implementing metrics, A/B tests, and monitoring, helping to harden prototypes for reliable rollouts.
Collaborate with senior engineers and cross-functional partners to select the right technologies, participate in code reviews, and share best practices (including mentoring interns or new hires as needed).
Summarize research findings and model evaluations into clear write-ups and demos for the team and cross-functional stakeholders.

A master’s degree or above, or equivalent experience in Computer Science, Electrical Engineering, or a related field, with an emphasis on building products using frontier multimodal LLMs (Large Language Models).
Expertise in agentic AI, pretraining, fine-tuning, and reinforcement learning of large language models.
3+ years of hands-on experience building large-scale, high-impact solutions, ideally with recent experience in agent-based systems, multi-agent collaboration, or similar paradigms.
Experience deploying and scaling AI services capable of handling hundreds of millions of daily interactions with high availability, low latency, and robust fault tolerance.
A track record of writing articles and publishing high-impact research in top AI conferences is a big plus.

Benefits

Tip: use these terms in your resume and cover letter to boost ATS matches.

data pipelinesLLMlarge language modelsagentic AIpretrainingfine-tuningreinforcement learningA/B testingdata qualitydataset management

collaborationmentoringcommunicationleadershipexperimentation

master's degreeComputer ScienceElectrical Engineering