
Senior Site Reliability Engineer
Ro
full-time
Posted on:
Location Type: Hybrid
Location: New York City • New York • United States
Visit company websiteExplore more
Salary
💰 $211,700 - $292,000 per year
Job Level
About the role
- Design and implement resilient infrastructure to support high availability at scale
- Build and contribute to tools and platforms that streamline deployment, monitoring and recovery of systems
- Drive incident response and harness learnings, leading efforts to minimize downtime and improve MTTR
- Partner with engineering teams to bake best practices for reliability, resilience and observability into services
- Automate infrastructure workflows using IaC and other cloud native tools
- Contribute to our culture of operational excellence, guiding engineers through reliability practices and raising the bar across the engineering org
Requirements
- Strong understanding of systems and infrastructure, with experience operating distributed services in production. We are mostly in AWS and leverage a lot of its primitives - EKS, RDS, Route53, S3, Elasticache to name a few
- Strong programming and automation skills using Go or Python
- Proficiency with infrastructure as code - Terraform / Pulumi
- A passion for observability, with hands-on experience in metrics, logging, tracing using Datadog
- Solid cross-functional communication, able to collaborate with product, platform, security and other teams
- An operational mindset that puts reliability and resilience as a core product requirement
- A mission-driven attitude, motivated by the opportunity to make healthcare better.
Benefits
- Full medical, dental, and vision insurance + OneMedical membership
- Healthcare and Dependent Care FSA
- 401(k) with company match
- Flexible PTO
- Wellbeing + Learning & Growth reimbursements
- Paid parental leave + Fertility benefits
- Pet insurance
- Student loan refinancing
- Virtual resources for mindfulness, counseling, and fitness
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
infrastructure designhigh availabilityincident responseinfrastructure as codeGoPythonTerraformPulumiobservabilitymetrics
Soft Skills
cross-functional communicationcollaborationoperational mindsetmission-driven attitude