Parts Town

SRE Lead, Observability

Parts Town

full-time

Posted on:

Location Type: Hybrid

Location: AddisonIllinoisUnited States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $99,133 - $133,784 per year

Job Level

About the role

  • Own enterprise observability using Dynatrace across cloud, on-prem, ERP, WMS, eCommerce, APIs, and integrations
  • Design service topology, dashboards, alerts, and health indicators that reflect business impact
  • Apply SRE principles (SLIs, SLOs, error budgets where appropriate) to reduce incidents and improve resilience
  • Accelerate incident detection and root-cause analysis; lead post-incident reviews focused on systemic fixes
  • Identify reliability, performance, and capacity risks before they impact the business
  • Define observability and SRE standards and enable teams to use them effectively

Requirements

  • 7+ years in infrastructure, platform, operations, or reliability engineering
  • Hands-on experience implementing and operating Dynatrace
  • Strong understanding of distributed systems, cloud/hybrid environments, and integrations
  • Practical experience with SRE or reliability engineering concepts
  • Comfortable operating in high-impact incident and production environments
Benefits
  • Quarterly profit-sharing bonus
  • Hybrid Work schedule
  • Team member appreciation events and recognition programs
  • Volunteer opportunities
  • Monthly IT stipend
  • Casual dress code
  • On-demand pay options: Access your pay as you earn it, to cover unexpected or even everyday expenses
  • Health insurance
  • 401k/401k match
  • Employee assistance programs
  • Paid time off
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
DynatraceSRE principlesSLIsSLOserror budgetsincident detectionroot-cause analysisservice topologydashboardshealth indicators
Soft Skills
leadershipproblem-solvingcommunicationcollaborationanalytical thinking