FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Software Engineering Group Manager – Site Reliability Center
PNCSoftware Engineering Group Manager at PNC responsible for site reliability engineering and production support. Leading high-impact teams and fostering collaborative technology organization for operational excellence.
Posted 6/26/2026full-timePittsburgh • Alabama, Arizona, Colorado, Pennsylvania, Texas • 🇺🇸 United StatesSeniorLead💰 $146,300 - $298,870 per yearWebsite
Tech Stack
Tools & technologiesCassandraElasticSearchKafkaLinuxMongoDBOracleRedisSQL
About the role
Key responsibilities & impact- Lead during moments that matter; own major incident response for high-impact (P1/P2) events, ensuring rapid resolution and clear communication.
- Provide technical leadership in production support; serve as an escalation point for complex production issues; guide troubleshooting across: applications, infrastructure (Linux/Windows), databases (Oracle, SQL), middleware and integrations;
- Drive root cause and real fixes; champion a culture of accountability through deep root cause analysis; eliminate repeat issues by driving permanent, systemic solutions and turn data and trends into actionable improvements.
- Shape reliability at scale; Define and evolve reliability strategy across availability, resiliency, and performance; lead improvements in uptime, MTTR, and overall system health and partner with engineering to embed reliability into system design.
- Modernize operations; advance observability with best-in-class monitoring, alerting, and event management; leverage tools like Dynatrace, BigPanda, and Logscale to enable proactive detection and drive automation to reduce manual effort and create self-healing systems.
- Ensure safe and reliable change; oversee change and release governance to enable fast and safe deployments; improve change success rates, reduce production defects, and lead post-release reviews that fuel continuous improvement.
- Lead a Global 24x7 Operation; manage distributed teams supporting critical systems around the clock; create seamless handoffs and strong operational discipline across regions and elevate team performance, engagement, and growth.
Requirements
What you’ll need- 8+ years of related experience and 5+ years of management experience.
- Proven leadership experience in Production Operations, SRE, or Infrastructure Engineering.
- Deep expertise in incident, problem, and change management within complex environments.
- Passion for building reliable, scalable, customer-centric platforms.
- Track record of improving operational metrics and leading high-performing teams.
- Strong executive presence and communication skills.
- Experience with OCP under infrastructure (Linux/Windows, OCP), MongoDB, Cassandra under databases (Oracle, SQL, MongoDB, Cassandra) and working knowledge of Elasticsearch, Redis, MQ and Kafka is a plus.
Benefits
Comp & perks- medical/prescription drug coverage (with a Health Savings Account feature)
- dental and vision options
- employee and spouse/child life insurance
- short and long-term disability protection
- 401(k) with PNC match
- pension and stock purchase plans
- dependent care reimbursement account
- back-up child/elder care
- adoption, surrogacy, and doula reimbursement
- educational assistance, including select programs fully paid
- a robust wellness program with financial incentives
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
incident managementproblem managementchange managementproduction supportroot cause analysisreliability engineeringmonitoringalertingautomationsystem design
Soft Skills
leadershipcommunicationoperational disciplineteam performanceengagementgrowthaccountabilitycustomer-centric focusexecutive presencetroubleshooting