Ping Identity

Senior Manager, Site Reliability Engineering

Ping Identity

full-time

Posted on:

Location Type: Remote

Location: United States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $166,737 - $235,000 per year

Job Level

About the role

  • Provide leadership and mentorship to a team of 8-10 Site Reliability Engineers (SREs).
  • Possess expertise in defining, measuring, and reporting on key Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to ensure adherence to the 99.99%+ uptime Service Level Agreement (SLA).
  • Collaborate effectively with other SRE, Security, and Development teams.
  • Define and implement processes to ensure the team efficiently meets target deadlines.
  • Drive the successful completion of large-scale projects, coordinating with multiple Development teams.
  • Conduct thorough capacity analysis and planning.
  • Effectively manage and scale infrastructure by establishing and adhering to automation standards.
  • Analyze and resolve complex system behavior, performance, and application issues.
  • Oversee comprehensive observability and analysis across multiple datacenters.

Requirements

  • Minimum of six years of experience leading a software-focused Site Reliability Engineering (SRE) team of eight to ten staff.
  • Demonstrated experience working within organizations operating on a global scale.
  • Proven ability to drive strategic decisions regarding "build vs. buy" technology choices.
  • Proficiency in developing, maintaining, and administering modern infrastructure tooling, with a strong emphasis on Infrastructure as Code (IaC) principles.
  • Experience provisioning public cloud resources utilizing IaC tools such as CloudFormation and Terraform.
  • Solid knowledge of scripting and programming standards (e.g., Python, Ruby, Bash, Go).
  • Experience with Docker and container orchestration platforms (e.g., Kubernetes).
  • Practical experience using Git in a large-scale team environment.
  • Understanding and application of security design principles.
  • Experience operating within a high-volume or mission-critical production service environment.
  • Expertise in IP networking, including familiarity with network functionality, operational procedures, and failure modes.
Benefits
  • Generous PTO & Holiday Schedule
  • Parental Leave
  • Progressive Healthcare Options
  • Retirement Programs
  • Opportunity for Education Reimbursement
  • Commuter Offset (Specific locations)
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Site Reliability EngineeringService Level ObjectivesService Level IndicatorsInfrastructure as CodeCloudFormationTerraformPythonRubyBashGo
Soft Skills
leadershipmentorshipcollaborationproject managementcapacity analysisproblem solvingcommunicationstrategic decision making