DoubleVerify

Staff AI Data Engineer

DoubleVerify

full-time

Posted on:

Location Type: Remote

Location: Remote • 🇬🇧 United Kingdom

Visit company website
AI Apply
Apply

Job Level

Lead

Tech Stack

Distributed Systems

About the role

  • Apply bleeding edge AI theory to the design and implementation of large-scale data systems that feed AI agents and autonomous workflows.
  • Use data science techniques to fine-tune, evaluate, and optimize LLMs for marketing-specific tasks: attribution insights, anomaly detection, summarization, classification, and automated recommendations.
  • Build end-to-end automations using LLMs, internal data, and external signals to eliminate repetitive human tasks.
  • Build AI-driven automations that reduce manual work across Rockerbox and unlock new client-facing capabilities.
  • Design retrieval, orchestration, and memory layers that make our AI agents smarter over time.
  • Establish best practices for AI data quality, observability, experiments, and safety.
  • Lead R&D initiatives: rapid prototyping, experimentation, model evaluations, and productionization.
  • Mentor data scientists and engineers across organization to raise the bar on LLM use company-wide.

Requirements

  • 8+ years of experience in data engineering, AI, ML platforms, or large-scale distributed systems.
  • Hands-on experience integrating LLMs into production systems (OpenAI, fine-tuning, embeddings, RAG, vector stores, or custom agent orchestration).
  • Strong understanding of experimentation, model evaluation, and performance tuning.
  • You think in systems: storage, retrieval, metadata, reliability, latency, failure modes.
  • Ability to work from ambiguity to execution — you’re comfortable being the first to figure something out.
  • Strong communication skills: you can explain tradeoffs, scope decisions, and technical strategy clearly.

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
data engineeringAIML platformslarge-scale distributed systemsLLMsfine-tuningembeddingsRAGvector storescustom agent orchestration
Soft skills
strong communication skillsability to work from ambiguity to executionmentoringleadership