Netflix

Site Reliability Engineer L5 – Live SRE

Netflix

full-time

Posted on:

Location Type: Remote

Location: United States

Visit company website

Explore more

AI Apply
Apply

About the role

  • Support live streaming events by focusing on cloud traffic (API Gateway, IPC between microservices).
  • Prepare and execute various load tests to ensure infrastructure can handle sudden API traffic increases.
  • Implement end-to-end observability and visualize data to achieve desired availability at scale.
  • Drive continual improvement in observability, monitoring, and scalability.
  • Implement, automate, execute, and analyze results from live streaming delivery focused tests.
  • Write and review code, develop documentation, and debug complex problems.
  • Coordinate and collaborate across multiple stakeholders for smooth event execution.
  • Participate in an on-call rotation and work flexible hours based on event schedules.

Requirements

  • 5+ years service reliability/operational experience running large scale, high performance systems & internet services with focus on traffic at scale.
  • Knowledge of and proven experience with L4 Load Balancer, HTTP cache, and reverse proxy technologies.
  • Expert-level knowledge of Unix or Linux systems and TCP/IP network fundamentals.
  • Proficient understanding of networking principles, transport, and application protocols, especially DNS, TLS, and HTTP(s) etc.
  • Proficient in a programming language such as Go, Python, Rust etc.
  • Experience with using real time and BigData analytics processing technologies (Kafka, time series database and Presto/Trino, Spark SQL etc)
  • Ability to work in a highly collaborative environment and to communicate effectively with internal and external partners.
  • Preferred - B.S. in Computer Science, Electrical or Computer Engineering (or equivalent professional experience).
Benefits
  • Inclusion is a Netflix value and we strive to host a meaningful interview experience for all candidates.
  • We are an equal-opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams.

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
API Gatewayload testingobservabilitymonitoringscalabilityUnixLinuxTCP/IPGoPython
Soft skills
collaborationcommunication