
SRE Lead, Observability
Parts Town
full-time
Posted on:
Location Type: Hybrid
Location: Addison • Illinois • United States
Visit company websiteExplore more
Salary
💰 $99,133 - $133,784 per year
Job Level
Tech Stack
About the role
- Own enterprise observability using Dynatrace across cloud, on-prem, ERP, WMS, eCommerce, APIs, and integrations
- Design service topology, dashboards, alerts, and health indicators that reflect business impact
- Apply SRE principles (SLIs, SLOs, error budgets where appropriate) to reduce incidents and improve resilience
- Accelerate incident detection and root-cause analysis; lead post-incident reviews focused on systemic fixes
- Identify reliability, performance, and capacity risks before they impact the business
- Define observability and SRE standards and enable teams to use them effectively
Requirements
- 7+ years in infrastructure, platform, operations, or reliability engineering
- Hands-on experience implementing and operating Dynatrace
- Strong understanding of distributed systems, cloud/hybrid environments, and integrations
- Practical experience with SRE or reliability engineering concepts
- Comfortable operating in high-impact incident and production environments
Benefits
- Quarterly profit-sharing bonus
- Hybrid Work schedule
- Team member appreciation events and recognition programs
- Volunteer opportunities
- Monthly IT stipend
- Casual dress code
- On-demand pay options: Access your pay as you earn it, to cover unexpected or even everyday expenses
- Health insurance
- 401k/401k match
- Employee assistance programs
- Paid time off
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
DynatraceSRE principlesSLIsSLOserror budgetsincident detectionroot-cause analysisservice topologydashboardshealth indicators
Soft Skills
leadershipproblem-solvingcommunicationcollaborationanalytical thinking