FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
Tech Stack
Tools & technologiesAzureCloudGoogle Cloud PlatformGrafanaLinux
About the role
Key responsibilities & impact- Manage operational delivery and stakeholder relationships for assigned clients.
- Provide specialized consulting advice based on engagement requirements and support executive-level decision-making.
- Lead and mentor a **team of SRE engineers (Level 7 ICs)**
- Own **end-to-end incident management operations** across regions
- Establish and drive **incident response processes and governance**
- Ensure effective **16x7 delivery model across geographies**
- Act as escalation point for **critical incidents and stakeholder communication**
- Drive continuous improvements in **MTTM and operational efficiency**
- Lead **process enhancements, SOP creation, and knowledge transfer planning**
Requirements
What you’ll need- Strong experience in **SRE / Incident Management leadership**
- Proven ability to manage **high-impact, complex incidents**
- Excellent **communication, stakeholder management, and leadership skills**
- Ability to **drive alignment and influence cross-functional teams**
- Ability to proactively identify risks using monitoring tools such as DataDog and Grafana dashboards
- Experience in incident response with capability to quickly restore services (restart, patch, or remediate live issues)
- Strong focus on minimizing service downtime across environments
- Hands-on experience supporting both on-premise (Linux environments) and cloud platforms (primarily Azure, with some exposure to GCP)
- Solid understanding of networking concepts and system architecture
Benefits
Comp & perks- Strong experience in **SRE / Incident Management leadership**
- Proven ability to manage **high-impact, complex incidents**
- Excellent **communication, stakeholder management, and leadership skills**
- Ability to **drive alignment and influence cross-functional teams**
- Ability to proactively identify risks using monitoring tools such as DataDog and Grafana dashboards
- Experience in incident response with capability to quickly restore services (restart, patch, or remediate live issues)
- Strong focus on minimizing service downtime across environments
- Hands-on experience supporting both on-premise (Linux environments) and cloud platforms (primarily Azure, with some exposure to GCP)
- Solid understanding of networking concepts and system architecture
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
SREincident managementMTTMprocess enhancementsSOP creationknowledge transfer planningLinuxAzureGCPnetworking concepts
Soft Skills
communicationstakeholder managementleadershiprisk identificationinfluencecross-functional collaborationmentoringcontinuous improvementescalation managementoperational delivery
