Enova International

Senior Site Reliability Engineer – Incident Management, Resiliency

Enova International

full-time

Posted on:

Location Type: Hybrid

Location: ChicagoIllinoisUnited States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $85,000 - $125,000 per year

Job Level

About the role

  • Lead production incidents as part of our PI PIC (or Incident Commander) rotation after completing training, ensuring clear communication and resolution.
  • Capture and maintain detailed documentation of incidents, contributing factors, and learnings in formal incident reports.
  • Deliver documentation that is clear, comprehensive, and accessible to different types of audiences in a timely manner within the established SLAs.
  • Facilitate and document blameless post-incident reviews that promote learning and continuous improvement.
  • Collect and analyze incident data in order to identify systemic issues, risks, and trends. Lead incident data reviews in front of a wide range of stakeholders, including technical and business leadership
  • Work on improvements to how we collect, analyze, and learn from system failures.
  • Champion a culture of operational excellence and resilience across the organization.
  • Collaborate with engineering, product, and operations teams to address vulnerabilities and build more resilient systems.
  • Design and run failure simulations (e.g., mock incidents, disaster recovery exercises) to proactively identify weak points.

Requirements

  • 5+ Experience in a technology or analyst role (e.g., Software Engineering, Systems, Operations, SRE, or Product).
  • A strong interest in how complex distributed systems operate—and how to make them more reliable.
  • Analytical and problem-solving skills with a systems-thinking mindset.
  • Strong communication skills, both verbal and written, with the ability to tailor messaging to technical and non-technical audiences.
  • Experience querying and analyzing data (e.g., SQL, PostgreSQL, Kafka).
  • Comfort with ambiguity, and the ability to turn vague problems into actionable insights.
  • Demonstrated maturity, sound judgment, and organizational awareness.
  • Ability to coordinate the resolution of major incidents and reviews following Enova Incident Management Process
  • Ability to seamlessly shift between high-urgency incident response and structured project work, with strong organizational skills and the capacity to manage projects independently.
Benefits
  • Health, dental, and vision insurance including mental health benefits
  • 401(k) matching plus a roth option (U.S. Based employees only)
  • PTO & paid holidays off
  • Sabbatical program (for eligible roles)
  • Summer hours (for eligible roles)
  • Paid parental leave
  • DEI groups (B.L.A.C.K. @ Enova, HOLA @ Enova, Women @ Enova, Pride @ Enova, South Asians @ Enova, APEX @ Enova, and Parents @ Enova)
  • Employee recognition and rewards program
  • Charitable matching and a paid volunteer day…Plus so much more!

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
incident managementdata analysisSQLPostgreSQLKafkasystems thinkingfailure simulationsdisaster recoverydocumentationincident reporting
Soft skills
analytical skillsproblem-solvingcommunication skillsorganizational skillsadaptabilityjudgmentcollaborationleadershipresiliencecontinuous improvement