FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
Tech Stack
Tools & technologiesCloudGrafanaITSMKubernetesPrometheusPythonServiceNowTerraform
About the role
Key responsibilities & impact- Build agentic, executable workflows capable of triaging, diagnosing, and where appropriate autonomously remediating known failure patterns
- Build and maintain LLM-backed agents targeting the observability stack, ITSM platform, and infrastructure APIs (e.g. DCIM, IPAM, hypervisor layers)
- Develop auditable Client focused automations, for Client interactions and workflows, with appropriate controls
- Maintain and contribute a library of automation assets, agent prompts, and runbook-as-code artefacts, version-controlled and peer-reviewed
- Develop the automation layer around monitoring and event management: alert suppression logic, enrichment pipelines, correlation rules, and alert-to-ticket integrations
- Identify common Operational patterns and tasks as candidates for automation; maintain and prioritise a toil reduction backlog
- Participate in post-incident reviews and translate findings into updated automation, runbooks, or agent logic
Requirements
What you’ll need- Prior experience in an SRE, Senior Operations, or Platform Engineering environment
- Strong Python development skills, including scripting for automation, API integration, and data processing
- Hands-on experience with observability and monitoring platforms: Prometheus, Grafana, Mimir, or equivalent
- Experience integrating with ITSM platforms (ServiceNow, Halo, Jira Service Management, or similar) via API
- Solid understanding of event-driven architectures, message queues, and webhook-based automation patterns
- Familiarity with Infrastructure-as-Code principles and cloud-native environments (Kubernetes, Terraform, or similar)
Benefits
Comp & perks- Flexible working hours
- Professional development opportunities
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PythonAPI integrationdata processingevent-driven architecturemessage queueswebhook-based automationInfrastructure-as-CodeKubernetesTerraformautomation
Soft Skills
collaborationproblem-solvingcommunicationprioritizationanalytical thinking
