
Research Staff, Voice AI Foundations
Deepgram
full-time
Posted on:
Location Type: Remote
Location: California • Missouri • United States
Visit company websiteExplore more
Salary
💰 $150,000 - $250,000 per year
Job Level
About the role
- Pioneer the development of Latent Space Models (LSMs)
- Build next-generation neural audio codecs
- Pioneer steerable generative models
- Develop embedding systems for codec latent space
- Leverage latent recombination for synthetic audio data
- Train multimodal speech-to-speech systems
- Design model architectures, training schemes, and inference algorithms
Requirements
- Strong mathematical foundation in statistical learning theory
- Deep expertise in foundation model architectures
- Proven ability to bridge theory and practice
- Demonstrated ability to build data pipelines
- Track record of designing controlled experiments
- Experience optimizing models for real-world deployment
- History of open-source contributions or research publications
Benefits
- Medical, dental, vision benefits
- Annual wellness stipend
- Mental health support
- Life, STD, LTD Income Insurance Plans
- Unlimited PTO
- Generous paid parental leave
- Flexible schedule
- 12 Paid US company holidays
- Quarterly personal productivity stipend
- One-time stipend for home office upgrades
- 401(k) plan with company match
- Tax Savings Programs
- Learning / Education stipend
- Participation in talks and conferences
- Employee Resource Groups
- AI enablement workshops / sessions
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Latent Space Modelsneural audio codecssteerable generative modelsembedding systemslatent recombinationmultimodal speech-to-speech systemsmodel architecturestraining schemesinference algorithmsstatistical learning theory
Soft Skills
ability to bridge theory and practicedesigning controlled experimentsoptimizing models for real-world deployment