
Senior Service Reliability Engineer
Fitch Group, Inc.
full-time
Posted on:
Location Type: Hybrid
Location: Manchester • United Kingdom
Visit company websiteExplore more
Job Level
About the role
- You’ll lead the delivery of reliable, scalable, mission-critical services.
- You’ll guide squads on Kubernetes and modern deployment patterns.
- Mentoring associate engineers and setting best practices for areas of expertise will help the team grow in knowledge and capabilities.
- Partner closely with Fitch Ratings Development Squads and Operations to design and advance service builds, DevOps tooling, and drive operational excellence.
- Partner with Core Engineering to architect and govern GitHub Actions CI/CD with quality gates, canary/blue‑green strategies, and AI‑assisted redeploy checks.
- Own observability in Datadog—define SLIs/SLOs, dashboards, alerting, and MS Teams integrations—and reduce incidents via telemetry-driven automation and blameless postmortems.
- Champion AI‑enabled operations using AWS Bedrock/SageMaker and Model Context Protocol (MCP) for log analysis, anomaly detection, incident triage, and workflow orchestration; establish adoption guardrails.
- Define and enforce cloud guardrails and security controls (SCPs/IAM boundaries, OPA policies, tagging, centralized logging with AWS Config/CloudTrail/Security Hub) in partnership with Security and Risk.
- Influence cross‑functional roadmaps, lead complex release planning, and drive strategic platform initiatives across CI&PE; serve as an escalation point and participate in the L3 on‑call rotation.
Requirements
- You have deep, hands-on experience in SRE, DevOps, or Platform Engineering across both AWS and Azure, with a strong track record operating Docker and Kubernetes in production environments.
- You’re highly proficient in administering both Linux and Windows, and have practical, enterprise-level experience supporting IIS/.NET applications as well as Java Spring Boot services.
- You have built and maintained CI/CD pipelines (primarily GitHub Actions; Bamboo experience a plus) with DevSecOps principles baked in—integrating security scans, policy-as-code, and compliance gates—and script confidently in Python, PowerShell, or Bash.
- You have experience with cloud security best practices (IAM, secrets management, container/image scanning) and understand core infrastructure fundamentals (networking, storage, DNS) and APM/telemetry tooling.
Benefits
- Hybrid Work Environment: 2 to 3 days a week in office required based on your line of business and location
- A Culture of Learning & Mobility: Dedicated trainings, leadership development and mentorship programs designed to ensure that your time at Fitch will be a continuous learning opportunity
- Investing in Your Future: Retirement planning and tuition reimbursement programs that empower you to achieve your short and long-term goals
- Promoting Health & Wellbeing: Comprehensive healthcare offerings that enable physical, mental, financial, social, and occupational wellbeing
- Supportive Parenting Policies: Family-friendly policies, including a generous global parental leave plan, designed to help you balance career and family life effectively
- Inclusive Work Environment: A collaborative workplace where all voices are valued, with Employee Resource Groups that unite and empower our colleagues around the globe
- Dedication to Giving Back: Paid volunteer days, matched funding for donations and ample opportunities to volunteer in your community
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
KubernetesAWSAzureDockerLinuxWindowsIIS.NETJava Spring BootPython
Soft Skills
mentoringleadershipcollaborationcommunicationstrategic planninginfluencingproblem-solvingoperational excellenceteam growthcross-functional teamwork