Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
CVS Health

Principal AIOps Engineer

CVS Health

Principal AIOps Engineer at CVS Health focusing on modernizing IT operations through observability and machine learning. Leading strategy and integrations to improve operational efficiency.

Posted 4/30/2026full-timeRemote • Pennsylvania • 🇺🇸 United StatesLead💰 $144,200 - $288,400 per yearWebsite

Tech Stack

Tools & technologies
CloudDistributed SystemsDNSGrafanaITSMLinuxPrometheusPythonServiceNowSplunkTCP/IP

About the role

Key responsibilities & impact
  • Lead the AIOps strategy, roadmap, and operating model to measurably improve MTTR, alert quality, and operational efficiency
  • Own the observability-to-AIOps pipeline and drive standardization of telemetry, service health models, and actionable alerting
  • Design and implement event intelligence: correlation, deduplication, suppression, anomaly detection, incident clustering, and probable-cause analysis
  • Advise operations, service owners, and leadership stakeholders; lead change enablement, adoption, and value measurement for AIOps
  • Develop ServiceNow-centric AIOps integrations: event ingestion, alert-to-incident policies, enrichment, assignment/routing
  • Establish governance for operational AI in partnership with security, compliance, and operations
  • Build and operationalize agentic AI workflows for incident triage and resolution
  • Enable closed-loop automation and self-healing by connecting AIOps detections to orchestrated actions
  • Partner with NOC/SOC, infrastructure, and application owners to onboard services into AIOps
  • Create enablement materials and coach teams on AIOps practices, agentic AI usage, and responsible automation

Requirements

What you’ll need
  • 10+ years of experience in SRE, production operations supporting highly available services
  • Proven technical leadership: ability to set direction, lead cross-team initiatives, and advise stakeholders through architecture reviews
  • Strong programming/scripting skills (Python preferred) and experience building automation, integrations, and APIs
  • Experience integrating observability platforms and event sources across hybrid environments (cloud/on-prem) and operating production-grade monitoring/event management at scale
  • Strong ServiceNow experience as an ITSM system of record
  • Ability to build and operate integrations at scale (REST, webhooks, event management) to support automation and auditability
  • Automation & Integration Engineering: Python (preferred) for automation and data/ML pipelines
  • Experience building integrations, services, and operational tooling
  • AIOps, ITSM/ITOM (ServiceNow) & Agentic AI Ecosystem: Observability: Prometheus/Grafana, OpenTelemetry, ELK/Splunk/Datadog (or equivalent)
  • Strong Linux and networking fundamentals (TCP/IP, DNS, TLS, load balancing) and ability to troubleshoot distributed systems end-to-end
  • Excellent communication skills.

Benefits

Comp & perks
  • medical, dental, and vision coverage
  • paid time off
  • retirement savings options
  • wellness programs
  • other resources, based on eligibility

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
AIOpsPythonautomationintegrationsAPIsobservabilityevent managementServiceNowLinuxnetworking
Soft Skills
technical leadershipcommunicationstakeholder advisingchange enablementcoachingcross-team collaborationvalue measurementincident triageproblem-solvingoperational efficiency