LMArena

Machine Learning Scientist - Open Source Lead

LMArena

full-time

Posted on:

Location: California • 🇺🇸 United States

Visit company website
AI Apply
Apply

Job Level

Senior

Tech Stack

PythonPyTorchTensorflow

About the role

  • Lead open-source ML research including open data set and code releases to advance AI model evaluation in the open.
  • Design, run, and share new methods and experiments revealing what makes models useful, trustworthy, and capable, grounded in human preference signals and released openly for the ecosystem and research community to build upon.
  • Take commitment to openness from principle to practice; curate high-impact datasets; develop new methodology and reproducible benchmarks; release code enabling the research ecosystem to push AI evaluations forward.
  • Shape the public leaderboard, power community tools, and strengthen transparency in AI evaluation worldwide.
  • Work interdisciplinary with engineers, product teams, marketing, and the broader research community to advance how we compare models, analyze preference data, and understand factors like style, reasoning, and robustness.
  • Collaborate with GTM teams as spokesperson for outreach for our open research efforts: strengthening partnerships, expanding research community participation, and championing programs that grow and support our research network.
  • If you’re excited by open-ended questions, rigorous evaluation, and scientific communication and outreach, you’ll find a meaningful home here.

Requirements

  • PhD or equivalent research experience in Machine Learning, Natural Language Processing, Statistics, or a related field
  • Uses personal and professional platforms to amplify open research initiatives and invite collaboration.
  • Strong understanding of LLMs and modern deep learning architectures (e.g., Transformers, diffusion models, reinforcement learning with human feedback)
  • Proficiency in Python and ML research libraries such as PyTorch, JAX, or TensorFlow
  • Demonstrated ability to design and analyze experiments with statistical rigor
  • Experience publishing research or working on open-source projects in ML, NLP, or AI evaluation
  • Comfortable working with real-world usage data and designing metrics beyond standard benchmarks
  • Ability to translate research questions into practical systems and collaborate across engineering and product teams
  • Passion for open science, reproducibility, and community-driven research
PwC

Senior Associate, AI Engineer

PwC
Seniorfull-time$77k–$202k / yearDistrict of Columbia, Florida, Illinois, New York, Texas · 🇺🇸 United States
Posted: 2 hours agoSource: pwc.wd3.myworkdayjobs.com
CloudDockerKubernetesMicroservices
General Dynamics Information Technology

Associate AI/ML Engineer – Active Secret Clearance Preferred

General Dynamics Information Technology
Junior · Midfull-time$65k–$86k / year🇺🇸 United States
Posted: 8 hours agoSource: gdit.wd5.myworkdayjobs.com
AWSAzureCloudGoogle Cloud PlatformJavaJavaScriptPythonSQL
Clariti

Senior MLOps Engineer

Clariti
Seniorfull-time$190k–$240k / year🇺🇸 United States
Posted: 1 day agoSource: boards.greenhouse.io
AWSAzureCloudDockerGoogle Cloud PlatformJenkinsKubernetesPython
General Motors

Senior AI/ML Engineer – Build Platform

General Motors
Seniorfull-time$170k–$240k / yearCalifornia · 🇺🇸 United States
Posted: 1 day agoSource: generalmotors.wd5.myworkdayjobs.com
DockerPython
CACI International Inc

AI Developer

CACI International Inc
Mid · Seniorfull-time$82k–$172k / yearVirginia · 🇺🇸 United States
Posted: 1 day agoSource: caci.wd1.myworkdayjobs.com
AWSEC2FlaskLinuxPythonSQLUnix