
Site Reliability Engineer L4 – Ads
Netflix
full-time
Posted on:
Location Type: Remote
Location: Remote • 🇺🇸 United States
Visit company websiteSalary
💰 $100,000 - $720,000 per year
Job Level
Mid-LevelSenior
Tech Stack
AWSAzureCloudDistributed SystemsGoGoogle Cloud PlatformJavaKubernetesPythonTerraform
About the role
- Design, implement, and maintain scalable and reliable infrastructure to support Netflix Ads Suite.
- Collaborate with engineering and product teams to integrate observability, reliability, and security considerations into the software development lifecycle.
- Coordinate capacity planning for Dynamic Ad Insertion for global-scale Netflix Live streaming.
- Develop and implement automation tools for monitoring, deployment, and incident response.
- Participate in on-call rotations to ensure 24/7 health of the Netflix Ad Suite.
- Contribute to incident response, diagnosis, and resolution.
- Implement and maintain a robust incident response framework.
- Proactively identify sources of instability in distributed systems.
- Champion a culture of reliability across the Ads organization.
Requirements
- 5+ years of experience as a Site Reliability Engineer (SRE), Production Engineer, or similar role supporting business-critical, high-traffic services.
- Write code to solve problems.
- Proficient in one or more languages like Python, Go, or Java.
- Hands-on experience with cloud providers such as AWS/Azure/GCP.
- Experience with Infrastructure as Code such as Terraform.
- Knowledge of container orchestration systems like Kubernetes.
- Understand large-scale distributed systems and their common failure modes.
- Thrive on collaboration and have excellent communication skills.
- Ability to navigate complex production issues and identify root causes.
- Possess a growth mindset and be committed to continuous improvement.
Benefits
- Health Plans
- Mental Health support
- 401(k) Retirement Plan with employer match
- Stock Option Program
- Disability Programs
- Health Savings and Flexible Spending Accounts
- Family-forming benefits
- Life and Serious Injury Benefits
- paid leave of absence programs
- Full-time hourly employees accrue 35 days annually for paid time off
- Full-time salaried employees are immediately entitled to flexible time off
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
Site Reliability EngineerProduction EngineerPythonGoJavaAWSAzureGCPTerraformKubernetes
Soft skills
collaborationcommunicationproblem-solvingroot cause analysisgrowth mindsetcontinuous improvement