
Audio Algorithm Engineer – Speaker Diarization
Plaud
full-time
Posted on:
Location Type: Hybrid
Location: San Francisco • California • United States
Visit company websiteExplore more
Salary
💰 $150,000 - $180,000 per year
About the role
- Research optimization solutions for terminology thesaurus from papers and design reasonable terminology filtering and hotword optimization solutions.
- Implement multi-language hotword algorithms based on SpeechLLM and optimize their effects; collaborate with the engineering team to deploy the hotword recognition solution.
- Combine scenario data to fine-tune the speech recognition model and improve ASR recognition effects across multiple languages and industries.
- Build a test set and system for keyword recognition and industry recognition engines, and evaluate the terminology recognition and industry engine effects of open-source models and commercial interfaces.
Requirements
- 3 to 5 years of speech algorithm training experience, with experience in fine-tuning and training SpeechLLM.
- Experience processing hundreds of thousands of hours of speech data and training speech recognition models.
- Familiar with SpeechLLM, speech SSL training, with from-scratch training experience for models similar to StepAudio, Qwen3omni, etc. Individual contributors responsible for model training within teams like the StepAudio speech group are preferred.
- Papers in top speech conferences like Interspeech, ICASSP, or patents related to speech.
Benefits
- Top-tier healthcare for employees and dependents, including dental and vision, and a generous employer subsidy.
- 401(k) plan for full time employees with company matching.
- Unlimited PTO, plus 13 paid holidays.
- 12 weeks of paid time off to spend time with your new family, regardless of gender.
- Minimum of 3x in office per week.
- New hires are equipped with their choice of new top-of-the-line laptops and workstation setups.
- Best office equipment. Annual offsites. Free office drinks and snacks.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
speech algorithm trainingfine-tuningSpeechLLMspeech recognition modelshotword algorithmsASR recognitionkeyword recognitionindustry recognition enginesopen-source modelscommercial interfaces