
Site Reliability Engineer
Zoox
full-time
Posted on:
Location Type: Hybrid
Location: Foster City • California • United States
Visit company websiteExplore more
Salary
💰 $170,000 - $205,000 per year
Tech Stack
About the role
- Design and implement highly scalable and reliable systems to support Zoox's autonomous vehicle platform.
- Optimize system performance, reliability, and scalability.
- Develop and maintain monitoring, alerting, and reporting systems to ensure proactive identification and resolution of issues.
- Collaborate with software engineering teams to improve software architecture, deployment processes, and automation.
- Conduct root cause analysis of production issues and implement corrective actions.
- Implement disaster recovery and business continuity plans.
Requirements
- 5+ years of experience in site reliability engineering or a similar role, with a strong background in working with large-scale distributed systems.
- Proven experience with cloud platforms such as AWS, GCP, or Azure.
- Expertise in container orchestration technologies like Kubernetes.
- Deep understanding of networking, storage, and database technologies.
- Strong programming skills in languages such as Python, Go, C/C++, or Java.
- Experience with infrastructure as code tools such as Terraform, Ansible, Salt, or CloudFormation.
Benefits
- paid time off (e.g. sick leave, vacation, bereavement)
- unpaid time off
- Zoox Stock Appreciation Rights
- Amazon RSUs
- health insurance
- long-term care insurance
- long-term and short-term disability insurance
- life insurance
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
site reliability engineeringlarge-scale distributed systemscloud platformsAWSGCPAzureKubernetesnetworking technologiesstorage technologiesdatabase technologies
Soft Skills
collaborationproblem-solvingroot cause analysiscommunication