
Director, Site Reliability Engineering
Diabetes Youth Families
full-time
Posted on:
Location Type: Hybrid
Location: California • Massachusetts • United States
Visit company websiteExplore more
Salary
💰 $188,300 - $282,500 per year
Job Level
Tech Stack
About the role
- Provide strategic direction for the organization-wide adoption, evolution, and maturity of SRE principles
- Develop and oversee automation strategies, tools, and frameworks that improve system reliability
- Architect and evolve robust observability, monitoring, and alerting systems
- Lead and govern high‑severity incident response practices
- Analyze reliability, performance, and capacity metrics to drive proactive optimization
- Partner with engineering, product, and operations teams to embed SRE practices throughout the development lifecycle
- Build, mentor, and develop a high‑performing SRE organization
- Oversee capacity planning, scalability assessments, and future‑state demand forecasting across critical systems
Requirements
- Bachelor’s in computer science, Engineering, or a related field
- 16 years of experience in the field including 6+ Site Reliability Engineering, DevOps, or a similar role
- Expertise with observability and monitoring platforms such as Datadog, Prometheus, Dynatrace, Grafana, ELK, or similar
- Strong proficiency in programming languages such as Python, Go, or Java
- Deep understanding of cloud platforms (AWS, Azure, GCP) and container orchestration technologies (Docker, Kubernetes)
- Advanced knowledge of AWS services including VPC, Lambda, IAM, ELB, EC2, ECS, CloudWatch, API Gateway, S3, SQS, SNS, WAF, and Route53
- Hands-on experience with infrastructure‑as‑code tools such as Terraform, Ansible, or equivalents
- Strong understanding of security best practices, compliance frameworks, and implementation of security controls
- Experience with chaos engineering, resilience testing, and failure-injection methodologies.
Benefits
- Medical, dental, and vision insurance
- 401(k) with company match
- Paid time off (PTO)
- And additional employee wellness programs
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Site Reliability EngineeringDevOpsPythonGoJavaAWSAzureGCPTerraformAnsible
Soft Skills
strategic directionmentoringleadershipcollaborationincident responseoptimizationcapacity planningscalability assessmentscommunicationgovernance