Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
CARBON3

Site Reliability Engineer

CARBON3

Automation Engineer building AI-driven Operations for Era4's engineering-led Operations Centre. Designing tooling and workflows in a mission-driven start-up focusing on critical national infrastructure.

Posted 6/11/2026full-time🇬🇧 United KingdomMid-LevelSeniorWebsite

Tech Stack

Tools & technologies
CloudGrafanaITSMKubernetesPrometheusPythonServiceNowTerraform

About the role

Key responsibilities & impact
  • Build agentic, executable workflows capable of triaging, diagnosing, and where appropriate autonomously remediating known failure patterns
  • Build and maintain LLM-backed agents targeting the observability stack, ITSM platform, and infrastructure APIs (e.g. DCIM, IPAM, hypervisor layers)
  • Develop auditable Client focused automations, for Client interactions and workflows, with appropriate controls
  • Maintain and contribute a library of automation assets, agent prompts, and runbook-as-code artefacts, version-controlled and peer-reviewed
  • Develop the automation layer around monitoring and event management: alert suppression logic, enrichment pipelines, correlation rules, and alert-to-ticket integrations
  • Identify common Operational patterns and tasks as candidates for automation; maintain and prioritise a toil reduction backlog
  • Participate in post-incident reviews and translate findings into updated automation, runbooks, or agent logic

Requirements

What you’ll need
  • Prior experience in an SRE, Senior Operations, or Platform Engineering environment
  • Strong Python development skills, including scripting for automation, API integration, and data processing
  • Hands-on experience with observability and monitoring platforms: Prometheus, Grafana, Mimir, or equivalent
  • Experience integrating with ITSM platforms (ServiceNow, Halo, Jira Service Management, or similar) via API
  • Solid understanding of event-driven architectures, message queues, and webhook-based automation patterns
  • Familiarity with Infrastructure-as-Code principles and cloud-native environments (Kubernetes, Terraform, or similar)

Benefits

Comp & perks
  • Flexible working hours
  • Professional development opportunities

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
PythonAPI integrationdata processingevent-driven architecturemessage queueswebhook-based automationInfrastructure-as-CodeKubernetesTerraformautomation
Soft Skills
collaborationproblem-solvingcommunicationprioritizationanalytical thinking