Datadog

Staff Applied Scientist – Observability Data Platform

Datadog

full-time

Posted on:

Origin:  • 🇺🇸 United States • California, Colorado, Massachusetts, New York, Washington

Visit company website
AI Apply
Apply

Salary

💰 $234,000 - $300,000 per year

Job Level

Lead

Tech Stack

Distributed SystemsGoPythonRust

About the role

  • Design and prototype intelligent systems for AI-native observability, including cost-aware agent orchestration, adaptive query execution, and self-optimizing system components
  • Lead efforts to apply reinforcement learning, search, or hybrid approaches to infrastructure-level decision-making (autoscaling, scheduling, load shaping)
  • Collaborate with AI researchers and platform engineers to design experimentation loops and verifiers guiding LLM outputs using runtime metrics and formal models
  • Explore emerging paradigms like AI compilers, “programming after code,” and runtime-aware prompt engineering to inform infrastructure and product design
  • Help define the direction of BitsEvolve — Datadog’s optimization agent that uses LLMs and evolutionary search
  • Partner with product teams and platform stakeholders to translate scientific advances into measurable improvements in cost, performance, and observability depth

Requirements

  • BS/MS/PhD in a scientific field or equivalent experience
  • 8+ years of experience in systems engineering, database internals, or infrastructure research, including hands-on experience in a production environment
  • Strong software engineering foundation, ideally in C++, Rust, Go, or Python
  • Deep expertise in at least one of: query optimization, data center scheduling, compiler design, reinforcement learning, or distributed systems design
  • Experience applying search, planning, or learning techniques to solve real-world optimization problems
  • Experience applying reinforcement learning, search, or hybrid approaches to infrastructure-level decision-making
  • Hypothesis-driven: experience designing experiments, simulations, benchmarks, or live-system evaluation loops
  • Comfortable reading research papers and building prototypes
  • Ability to collaborate across research, engineering, and product teams
x.ai

Systems Engineer – Rust, C++, Sandbox Service

x.ai
Mid · Seniorfull-time$180k–$440k / yearCalifornia · 🇺🇸 United States
Posted: 3 hours agoSource: boards.greenhouse.io
Distributed SystemsGoLinuxPythonRust
NVIDIA

Senior Software Engineer - Distributed Inference

NVIDIA
Seniorfull-time$184k–$357k / yearArizona, Colorado · 🇺🇸 United States
Posted: 29 days agoSource: nvidia.wd5.myworkdayjobs.com
AWSAzureCloudDistributed SystemsGoogle Cloud PlatformKubernetesMicroservicesPythonRust
Keycard Labs

Staff Data Engineer

Keycard Labs
Leadfull-time🇨🇦 Canada
Posted: 19 days agoSource: jobs.ashbyhq.com
Distributed SystemsGoHerokuKafkaOpen SourcePythonRust
TIDB

Senior Software Engineer - Data Replication

TIDB
Seniorfull-time🇺🇸 United States
Posted: 26 days agoSource: boards.greenhouse.io
CloudDistributed SystemsGoJavaLinuxRustSQLUnix
Improbable

Senior Systems Engineer

Improbable
Seniorfull-time🇬🇧 United Kingdom
Posted: 20 days agoSource: jobs.ashbyhq.com
Distributed SystemsRustWeb3