FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
Tech Stack
Tools & technologiesAWSGoKubernetesPythonSplunkTerraform
About the role
Key responsibilities & impact- combine deep technical expertise with team leadership to drive reliability
- lead other SREs in solving complex operational challenges
- establish technical standards and serve as an advisor to engineering leadership
- lead cross-functional reliability initiatives
- architect enterprise-scale infrastructure solutions
- establish Service Level Objectives (SLOs)
- lead major incident response as incident commander
- drive strategic improvements to observability
- evaluate and introduce new technologies
Requirements
What you’ll need- 6-10 years of experience in Site Reliability Engineering (or equivalent)
- Proven ability to lead technical teams
- Expert-level knowledge of AWS
- Deep Kubernetes expertise
- Mastery of Infrastructure as Code using Terraform
- Strong software engineering background with production experience in Python and/or Go
- Extensive experience with observability platforms (Datadog, Splunk)
- Deep understanding of CI/CD principles
- Proven track record leading major incidents
Benefits
Comp & perks- flexible work environment
- fluid career paths
- celebrating internal mobility
- purpose and well-being recognition
- work-life balance initiatives
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Site Reliability EngineeringAWSKubernetesInfrastructure as CodeTerraformPythonGoobservability platformsDatadogSplunk
Soft Skills
team leadershipproblem solvingtechnical standards establishmentcross-functional collaborationincident response leadershipstrategic improvement
