Oversee all aspects of the Zscaler production data center services, including servers, operating systems, storage, and supporting systems
Be an integral part of the SRE team responsible for ensuring the cloud's availability, latency, performance, efficiency, monitoring, and emergency response
Work closely with Software Engineering, Development, and Infrastructure teams to understand, implement, and deploy end-to-end monitoring solutions
Deploy patches and upgrades, and update administrative tools and utilities as required
Monitor applications and services within the environments, participate in on-call rotations, take action to resolve issues, and implement strategies to prevent future occurrences
Requirements
2+ years of experience in 24/7 SRE/NOC operations, production cloud platforms, and related automation workflows - - US CITIZENSHIP IS REQUIRED
Strong technical foundation, including proficiency in Linux/UNIX, familiarity with programming languages like Python and Go, and scripting with Bash
Solid understanding of networking fundamentals, including HTTP, DNS, TCP/IP, ICMP, the OSI Model, subnetting, and load balancing
Adaptable to fast-paced environments with the flexibility to support after-hours and weekend deployments or releases
Passion for improving processes, optimizing workflows, and solving technical challenges to maintain high SLA cloud platform resilience
Benefits
Various health plans
Time off plans for vacation and sick time
Parental leave options
Retirement options
Education reimbursement
In-office perks, and more!
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.