FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Lead Site Reliability Developer – CSRE Consulting
TicketmasterLead Site Reliability Engineer at Ticketmaster ensuring reliability and resilience across teams and domains. Collaborate with various stakeholders to drive improvements in operational practices and team capabilities.
Tech Stack
Tools & technologiesAWSDistributed SystemsKubernetes
About the role
Key responsibilities & impact- Lead consulting work from discovery through delivery by aligning stakeholders on priorities, sequencing work, and communicating measurable outcomes.
- Establish working cadence and facilitate decision forums to surface risks, map dependencies, and drive clear ownership and timelines.
- Align product, platform, and engineering stakeholders on reliability targets and trade-offs using SLOs and error budgets.
- Partner regularly with Engineering Managers, product managers, Staff and Principal engineers, and platform leads to keep dependencies, decisions, and delivery aligned.
- Identify systemic risks across shared dependencies and coordinate remediation across multiple teams to reduce recurring incidents.
- Drive change adoption by embedding reliability mechanisms into partner team routines such as planning, PRRs, and on-call practices.
- Design and implement reusable reliability mechanisms, templates, and tooling that can be adopted across teams.
- Establish and evolve production readiness review practices with partner teams to improve launch quality and change safety.
- Drive observability strategy for partner domains by improving signal quality, alerting philosophy, and operational dashboards.
- Lead complex incident investigations and ensure learnings translate into durable fixes with clear owners and verification.
- Lead reliability-focused design and code reviews and guide teams toward simpler, safer architectures.
- Mentor Senior engineers and other consultants through pairing, reviews, and structured coaching to multiply impact.
- Partner with internal platform engineering to influence roadmaps and deliver shared capabilities that accelerate SRE adoption.
- Improve CSRE Consulting playbooks and operating practices based on repeated patterns observed across teams.
Requirements
What you’ll need- Deep practical understanding of SRE principles, including SLO governance and error budget policy in practice.
- Proven ability to lead cross-team technical work and influence without authority.
- Strong experience designing and troubleshooting distributed systems with cross-service failure modes.
- Experience shaping observability and alerting strategy and improving operational signal quality.
- Strong Kubernetes and AWS experience, including governance and cost trade-offs.
- Ability to design reliability automation and tooling that is reusable and adopted by multiple teams.
- Experience leading production readiness and resilience practices, including DR validation and controlled testing.
- Strong software engineering fundamentals with the ability to deliver and review high-quality changes in enterprise codebases.
- Advanced incident analysis skills focused on systemic risk reduction and organizational learning.
- Excellent communication skills, including exec-ready summaries and clear technical diagrams.
Benefits
Comp & perks- Medical, vision, dental and mental health benefits for you and your family, with access to a health care concierge, and Flexible or Health Savings Accounts (FSA or HSA)
- Free concert tickets, generous paid time off including paid holidays, sick time, and personal days
- 401(k) program with company match, stock reimbursement program
- New parent programs including caregiver leave, plus fertility, adoption, foster, or surrogacy support
- Career and skill development programs with School of Live, tuition reimbursement, and student loan repayment
- Volunteer time off, crowdfunding match
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
SRE principlesSLO governanceerror budget policydistributed systemsobservability strategyalerting strategyKubernetesAWSreliability automationproduction readiness
Soft Skills
leadershipinfluence without authoritycommunicationmentoringcollaborationproblem-solvingincident analysisorganizational learningcoachingstakeholder alignment