
Software Engineer L5, Offline Inference, Machine Learning Platform
Netflix
full-time
Posted on:
Location Type: Remote
Location: Remote • California • 🇺🇸 United States
Visit company websiteSalary
💰 $100,000 - $720,000 per year
Job Level
Mid-LevelSenior
Tech Stack
AWSCloudDockerJavaKubernetesPythonScala
About the role
- Build developer-friendly APIs, SDKs, and CLIs that let researchers and engineers—experts and non-experts alike—submit and manage batch inference jobs with minimal effort, particularly in the domain of content and media.
- Design, implement, and operate distributed services that package, schedule, execute, and monitor batch inference workflows at massive scale.
- Instrument the platform for reliability, debuggability, observability, and cost control; define SLOs and share an equitable on-call rotation.
- Foster a culture of engineering excellence through design reviews, mentorship, and candid, constructive feedback.
Requirements
- Hands-on experience with ML engineering or production systems involving training or inference of deep-learning models.
- Proven track record of operating scalable infrastructure for ML workloads (batch or online).
- Proficiency in one or more modern backend languages (e.g. Python, Java, Scala).
- Production experience with containerization & orchestration (Docker, Kubernetes, ECS, etc.) and at least one major cloud provider (AWS preferred).
- Commitment to operational best practices—observability, logging, incident response, and on-call excellence.
- Excellent written and verbal communication skills; effective collaboration across distributed teams and time zones.
- Comfortable working in a team with peers and partners distributed across (US) geographies & time zones.
Benefits
- Health Plans
- Mental Health support
- 401(k) Retirement Plan with employer match
- Stock Option Program
- Disability Programs
- Health Savings and Flexible Spending Accounts
- Family-forming benefits
- Life and Serious Injury Benefits
- Paid leave of absence programs
- Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off.
- Full-time salaried employees are immediately entitled to flexible time off.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
ML engineeringdeep-learning modelsscalable infrastructurebackend languagesPythonJavaScalacontainerizationorchestrationcloud provider
Soft skills
communication skillscollaborationmentorshipfeedbackengineering excellence