Anthropic

Research Engineer, Audio

Anthropic

full-time

Posted on:

Location Type: Hybrid

Location: San FranciscoCaliforniaNew YorkUnited States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $350,000 - $500,000 per year

Tech Stack

About the role

  • Work across the full stack of audio ML, developing audio codecs and representations
  • Source and synthesize high quality audio data
  • Train large-scale speech language models and large audio diffusion models
  • Develop novel architectures for incorporating continuous signals into LLMs
  • Build advanced steerable systems spanning end-to-end conversational systems
  • Collaborate with many teams across pretraining, finetuning, reinforcement learning, production inference, and product

Requirements

  • Have hands-on experience with training audio models, whether that's conversational speech-to-speech, speech translation, speech recognition, text-to-speech, diarization, codecs, or generative audio models
  • Genuinely enjoy both research and engineering work, and you'd describe your ideal split as roughly 50/50 rather than heavily weighted toward one or the other
  • Are comfortable working across abstraction levels, from signal processing fundamentals to large-scale model training and inference optimization
  • Have deep expertise with JAX, PyTorch, or large-scale distributed training, and can debug performance issues across the full stack
  • Thrive in fast-moving environments where the most important problem might shift as we learn more about what works
  • Communicate clearly and collaborate effectively; audio touches many parts of our systems, so you'll work closely with teams across the company
  • Are passionate about building conversational AI that feels natural, steerable, and safe
  • Care about the societal impacts of voice AI and want to help shape how these systems are developed responsibly.
Benefits
  • Competitive compensation and benefits
  • Optional equity donation matching
  • Generous vacation and parental leave
  • Flexible working hours
  • Lovely office space for collaboration
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
audio MLaudio codecsspeech language modelsaudio diffusion modelsconversational speech-to-speechspeech translationspeech recognitiontext-to-speechdiarizationgenerative audio models
Soft Skills
collaborationcommunicationadaptabilityproblem-solvingpassion for AIresponsibility in development