
Staff Observability Operations Engineer
CVS Health
full-time
Posted on:
Location Type: Hybrid
Location: Connecticut • United States
Visit company websiteExplore more
Salary
💰 $130,295 - $260,590 per year
Job Level
Tech Stack
About the role
- Deploy and implement modern observability solutions to meet organizational needs
- Ensure successful integration of observability, event management, and notification tools and technologies within the existing environment
- Work with partners to migrate legacy monitoring to modern solutions
- Manage and administer observability and event management platforms
- Lead system upgrades, patching, and maintenance activities to ensure optimal performance and security
- Coordinate and manage release cycles for observability platforms
- Troubleshoot and resolve incidents related to observability platforms
- Continuously monitor and enhance platform performance to support scalability and complexity
- Collaborate with cross-functional infrastructure, application, and business stakeholders to ensure observability solutions align with the broader IT strategy and infrastructure requirements
Requirements
- 7+ Years of experience in IT operations, with significant responsibilities in system monitoring, performance tuning, and troubleshooting enterprise applications
- 5+ Years in a Site Reliability Engineering (SRE) role deploying and managing modern observability solutions
- 5+ Years managing and implementing observability and event management platforms (e.g., AppDynamics, Splunk, Prometheus, Grafana)
- Experience developing and administering ServiceNow ITOM event management solutions, ensuring seamless integration with observability tools
- Experience deploying and managing service reliability platforms (e.g., xMatters, OpsGenie, PagerDuty), configuring incident notifications, incident command workflows, and automating incident remediation workflows
- Experience with and deep knowledge of cloud environments, cloud monitoring platforms, and container orchestration tools (e.g., AWS/CloudTrail, Azure/Monitor, GCP/GCM, Kubernetes, OpenShift)
- Proficiency in Python and other scripting languages such as Ansible, PowerShell, and Bash for automation and configuration
- Experience with and passion for deploying things “as code”
Benefits
- Affordable medical plan options
- 401(k) plan (including matching company contributions)
- Employee stock purchase plan
- No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs
- Confidential counseling and financial coaching
- Paid time off
- Flexible work schedules
- Family leave
- Dependent care resources
- Colleague assistance programs
- Tuition assistance
- Retiree medical access
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
system monitoringperformance tuningtroubleshootingobservability solutionsevent managementcloud monitoringcontainer orchestrationautomationPythonscripting languages
Soft skills
collaborationleadershipproblem-solvingcommunicationorganizational skills