Senior Staff Production Engineer

Sr. Staff Production Engineer at Zscaler implementing scalable multi-cloud infrastructure.

Posted 4/21/2026full-timeSan Jose • California • 🇺🇸 United StatesSenior💰 $140,000 - $200,000 per yearWebsite

Tools & technologies

AWSAzureGoGoogle Cloud PlatformGrafanaLinuxPrometheusPython

Key responsibilities & impact

Design and implement highly available, scalable infrastructure across AWS, Azure, GCP, and bare-metal environments
Drive an "automation-first" culture by writing code (Python/Go) to eliminate manual toil and build self-healing systems
Implement and maintain sophisticated observability (Prometheus, Grafana, OpenTelemetry), define SLIs/SLOs, and establish error budgets
Act as a lead Incident Commander (TDO on-call), develop response playbooks, and conduct deep-dive post-incident analyses
Partner with Engineering and partner teams to conduct operability reviews

What you’ll need

8+ years of experience managing reliability, scalability, and availability for large-scale production services
Deep expertise in programming (e.g., Python, Go, or C/C++)
Strong background in networking protocols, Linux/FreeBSD systems, and distributed architecture
Experience in high-stakes incident management and participation in a 24/7 on-call rotation
Proficiency in leveraging ITIL frameworks and incident data to drive service maturity through systematic problem management and technical operability reviews

Comp & perks

ATS Keywords

✓ Tailor your resume

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools

PythonGoC/C++AWSAzureGCPPrometheusGrafanaOpenTelemetryLinux

Soft Skills

leadershipincident managementcommunicationcollaborationproblem management