Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
White Circle

Research Scientist/Engineer – Agentic Systems

White Circle

Research Scientist/Engineer designing environments to study AI agents' failures at White Circle. Building complex, large-scale settings for autonomous agents in Paris or London.

Posted 6/29/2026full-timeParis • 🇫🇷 FranceMid-LevelSeniorWebsite

About the role

Key responsibilities & impact
  • Build adversarial environments for agents: complex, uncertain settings that sit on the boundary of agent capability and alignment, where failure is informative rather than trivial.
  • Build realistic multi-agent environments and instrument them so emergent breakdowns are observable — failures that arise from the agents themselves, not ones scripted from the outside.
  • Run experiments end to end, against external APIs and our own models, orchestrating many agents in parallel.
  • Catalogue concrete agent failure modes and build the tooling to surface them at scale.
  • Turn findings into internal models of agent behaviour and into public writeups.

Requirements

What you’ll need
  • Have built at least one non-trivial agent environment or automated research pipeline that ran end to end (single- or multi-agent), and can talk through what broke and why.
  • Strong software and AI engineering. Can independently orchestrate many agents and containers in parallel without that orchestration being the bottleneck.
  • A track record of empirical research in agents, red-teaming, or post-training where you defined the question, ran it, and drew a defensible conclusion.
  • A fast empirical iterator who is comfortable defining the question when there's no playbook: can take a fuzzy concern ("do these agents collude under pressure?") and turn it into a concrete, falsifiable experiment.
  • An AI power-user — fluent with frontier models and coding agents in your daily work.
  • Published research at A* venues on automated red-teaming, agentic environments, or post-training.
  • Experience building monitoring for model failures and anomalous behaviour.
  • Experience reproducing public benchmark results and finding where the original methodology is fragile or misleading.
  • An MSc or PhD in machine learning, computer science, cognitive science, computational neuroscience, physics, or a related quantitative field.
  • AI safety fellowship (MATS, ASTRA, Anthropic Fellows, etc.), or a comparable self-directed research record.

Benefits

Comp & perks
  • Paid time off in line with your local regulations, no matter where you work from
  • Comprehensive medical insurance for our France-based team
  • All the hardware, tools, and services you need
  • Covered subscriptions for AI agents and IDEs
  • Team off-sites twice a year: we’ve recently been to the Alps and to Saint-Tropez

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Agent Environment DevelopmentAutomated Research PipelineOrchestration of AgentsEmpirical Research MethodologyExperimentationAI Model CodingBenchmark ReproductionFailure Mode AnalysisData AnalysisStatistical Modeling
Soft Skills
Problem-SolvingCommunicationCritical ThinkingCollaborationAdaptability
Certifications
AI Safety FellowshipMSc in Machine LearningPhD in Computer SciencePhD in Cognitive SciencePhD in Computational NeurosciencePhD in Physics