FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
Tech Stack
Tools & technologiesAzureCloud
About the role
Key responsibilities & impact- Define and manage SLIs, SLOs, Error Budgets, MTTR, change failure rate, and availability targets.
- Continuously improve platform reliability, scalability, resilience, and operational maturity.
- Lead Sev-1 / Sev-2 incident management, escalation handling, and RCA reviews.
- Conduct blameless postmortems and drive preventive actions.
- Build operational runbooks, self-healing automation, and on-call processes.
- Participate in architecture reviews for HA, DR, failover, and performance optimization.
Requirements
What you’ll need- 8 - 10 years in SRE, DevOps, Cloud Engineering, or Production Operations.
- Minimum 5+ years hands-on with Microsoft Azure production environments.
- Proven experience managing critical enterprise workloads.
- Strong customer-facing / managed services background preferred.
Benefits
Comp & perks- Health insurance
- 401(k) matching
- Flexible work hours
- Paid time off
- Remote work options
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
SLIsSLOsError BudgetsMTTRchange failure rateavailability targetsself-healing automationarchitecture reviewsHADR
Soft Skills
incident managementescalation handlingRCA reviewsblameless postmortemspreventive actions
