CVS Health

Staff Observability Operations Engineer

CVS Health

full-time

Posted on:

Location Type: Hybrid

Location: ConnecticutUnited States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $130,295 - $260,590 per year

Job Level

About the role

  • Deploy and implement modern observability solutions to meet organizational needs
  • Ensure successful integration of observability, event management, and notification tools and technologies within the existing environment
  • Work with partners to migrate legacy monitoring to modern solutions
  • Manage and administer observability and event management platforms
  • Lead system upgrades, patching, and maintenance activities to ensure optimal performance and security
  • Coordinate and manage release cycles for observability platforms
  • Troubleshoot and resolve incidents related to observability platforms
  • Continuously monitor and enhance platform performance to support scalability and complexity
  • Collaborate with cross-functional infrastructure, application, and business stakeholders to ensure observability solutions align with the broader IT strategy and infrastructure requirements

Requirements

  • 7+ Years of experience in IT operations, with significant responsibilities in system monitoring, performance tuning, and troubleshooting enterprise applications
  • 5+ Years in a Site Reliability Engineering (SRE) role deploying and managing modern observability solutions
  • 5+ Years managing and implementing observability and event management platforms (e.g., AppDynamics, Splunk, Prometheus, Grafana)
  • Experience developing and administering ServiceNow ITOM event management solutions, ensuring seamless integration with observability tools
  • Experience deploying and managing service reliability platforms (e.g., xMatters, OpsGenie, PagerDuty), configuring incident notifications, incident command workflows, and automating incident remediation workflows
  • Experience with and deep knowledge of cloud environments, cloud monitoring platforms, and container orchestration tools (e.g., AWS/CloudTrail, Azure/Monitor, GCP/GCM, Kubernetes, OpenShift)
  • Proficiency in Python and other scripting languages such as Ansible, PowerShell, and Bash for automation and configuration
  • Experience with and passion for deploying things “as code”
Benefits
  • Affordable medical plan options
  • 401(k) plan (including matching company contributions)
  • Employee stock purchase plan
  • No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs
  • Confidential counseling and financial coaching
  • Paid time off
  • Flexible work schedules
  • Family leave
  • Dependent care resources
  • Colleague assistance programs
  • Tuition assistance
  • Retiree medical access

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
system monitoringperformance tuningtroubleshootingobservability solutionsevent managementcloud monitoringcontainer orchestrationautomationPythonscripting languages
Soft skills
collaborationleadershipproblem-solvingcommunicationorganizational skills