TASC

Director, Site Reliability Engineering

TASC

full-time

Posted on:

Location Type: Office

Location: PuneIndia

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Lead and develop a team of highly skilled people leaders and senior individual contributors within the CX Technology organization, fostering a culture of accountability, innovation, and continuous improvement
  • Define and drive the short-term and medium-term strategic vision for Site Reliability Engineering, aligning reliability, scalability, and operational efficiency initiatives with broader Mastercard technology and business objectives
  • Lead the design and execution of cross-functional initiatives that improve system resilience, automate operational processes, and mature incident management, problem management, and reliability engineering practices
  • Establish, evolve, and govern reliability standards, operational best practices, and control frameworks to ensure consistent adoption across engineering and delivery teams
  • Partner closely with engineering, product, architecture, and business stakeholders to embed reliability requirements into system design, development, deployment, and lifecycle management processes
  • Oversee major incident response and escalation efforts, ensuring rapid recovery, effective communication, and high-quality root cause analysis with actionable remediation
  • Promote proactive risk identification and mitigation through observability, capacity planning, resiliency testing, and automation-driven approaches
  • Champion continuous improvement by leveraging operational metrics, insights, and retrospectives to drive measurable improvements in availability, stability, and customer experience
  • Stay informed on industry trends, emerging technologies, and modern SRE practices, applying relevant innovations to advance Mastercard’s operational maturity
  • Manage goal setting, coaching, performance management, and talent development for people leaders and senior technologists, building a strong leadership pipeline and sustaining operational excellence at scale.

Requirements

  • Proven experience leading Site Reliability Engineering, Production Engineering, or large-scale operations teams within complex, highly available, distributed technology environments
  • Strong people leadership background, including managing managers and/or senior technical leaders, with demonstrated success building high-performing, inclusive teams
  • Deep understanding of reliability engineering principles, including incident management, automation, telecom, observability, resilience engineering, capacity planning, and service lifecycle management
  • Demonstrated ability to translate strategy into execution by evolving processes, programs, and policies to drive meaningful and measurable operational improvements
  • Experience partnering across engineering, product, and business functions to influence design decisions and embed reliability throughout the development lifecycle
  • Strong analytical and problem-solving skills, with a track record of driving root cause analysis and long-term corrective actions
  • Excellent communication and stakeholder management skills, with the ability to lead through influence at senior and executive levels
  • Passion for continuous improvement, operational discipline, and leveraging technology to reduce toil and improve system outcomes at scale
  • Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience; advanced degree preferred.
Benefits
  • Must abide by Mastercard’s security policies and practices
  • Ensure the confidentiality and integrity of the information being accessed
  • Report any suspected information security violation or breach
  • Complete all periodic mandatory security trainings
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Site Reliability Engineeringincident managementautomationobservabilityresilience engineeringcapacity planningservice lifecycle managementroot cause analysisoperational metricscontinuous improvement
Soft Skills
people leadershipcoachingperformance managementstakeholder managementanalytical skillsproblem-solvingcommunicationinfluenceteam buildingoperational discipline
Certifications
Bachelor’s degree in Computer ScienceBachelor’s degree in Engineeringadvanced degree preferred