Zillow

Machine Learning Engineer – User Understanding

Zillow

full-time

Posted on:

Location Type: Remote

Location: Remote • California, Connecticut, District of Columbia, Maryland, Massachusetts, New Jersey, New York, Washington • 🇺🇸 United States

Visit company website
AI Apply
Apply

Salary

💰 $145,500 - $232,500 per year

Job Level

Mid-LevelSenior

About the role

  • Build and maintain data pipelines for LLM (Large Language Model) training and evaluation, curate user-understanding signals (such as intents, preferences, and behavioral features), and ensure data quality, privacy, and proper dataset management.
  • Develop and manage labeling and feedback loops, including heuristics, annotation jobs, and prompt-based labeling, to create high-quality corpora, collaborating with Data Engineering and Applied Science partners to improve data coverage and reduce noise.
  • Design, prototype, and ship to production agentic AI solutions, including multi-agent systems using frameworks like LangGraph, and implement context-aware features in partnership with senior engineers.
  • Implement an evaluation framework to measure model quality on offline test sets (accuracy, bias, safety, user-intent coverage), and build dashboards to track improvements over time.
  • Lead and contribute to experimentation by implementing metrics, A/B tests, and monitoring, helping to harden prototypes for reliable rollouts.
  • Collaborate with senior engineers and cross-functional partners to select the right technologies, participate in code reviews, and share best practices (including mentoring interns or new hires as needed).
  • Summarize research findings and model evaluations into clear write-ups and demos for the team and cross-functional stakeholders.

Requirements

  • A master’s degree or above, or equivalent experience in Computer Science, Electrical Engineering, or a related field, with an emphasis on building products using frontier multimodal LLMs (Large Language Models).
  • Expertise in agentic AI, pretraining, fine-tuning, and reinforcement learning of large language models.
  • 3+ years of hands-on experience building large-scale, high-impact solutions, ideally with recent experience in agent-based systems, multi-agent collaboration, or similar paradigms.
  • Experience deploying and scaling AI services capable of handling hundreds of millions of daily interactions with high availability, low latency, and robust fault tolerance.
  • A track record of writing articles and publishing high-impact research in top AI conferences is a big plus.
Benefits
  • competitive base salary
  • equity awards based on experience, performance, and location

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
data pipelinesLLMlarge language modelsagentic AIpretrainingfine-tuningreinforcement learningA/B testingdata qualitydataset management
Soft skills
collaborationmentoringcommunicationleadershipexperimentation
Certifications
master's degreeComputer ScienceElectrical Engineering