FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Site Reliability and DevOps Engineering Lead
MerativePlatform Reliability & DevOps Engineering Lead at Micromedex by Merative ensuring high availability of clinical decision support systems. Leading DevOps strategy and team with focus on operational excellence.
Posted 6/17/2026full-timeRemote • California • 🇺🇸 United StatesSenior💰 $131,381 - $197,072 per yearWebsite
Tech Stack
Tools & technologiesJavaPython
About the role
Key responsibilities & impact- Lead, mentor, and grow Platform / DevOps engineers
- Build a high-performing Platform team
- Drive accountability for platform reliability and delivery outcomes
- Lead vendors to deliver capabilities in production.
- Ensure platform capabilities accelerate product delivery, remove bottlenecks.
- Defines and enforces platform engineering standards and DevOps practices across all teams and vendors
- Lead capacity planning, performance optimization, and cost efficiency
- Define operational standards, runbooks, and reliability practices
- Accountable for platform reliability outcomes at enterprise/product level
- Act as technical authority across platform, reliability, and delivery
- Define platform strategy and roadmap
- Govern delivery across internal teams and vendors
- Own SLIs, SLOs, and error budgets
- Lead resilience engineering, observability, and failure design
- Drive proactive risk reduction and continuous improvement
- Own incident management frameworks and continuous improvement
- Own end-to-end pipeline architecture and release automation
- Standardize, secure, and fully automate pipelines
- Drive continuous integration, delivery, and validation practices
- Lead Sev1 response, escalation, and recovery
- Own RCA and drive systemic fixes (not point fixes)
- Introduce AI-enabled pipeline optimization and quality gates
- Embed AI into monitoring, risk prediction, and CI/CD optimization
- Drive automation to reduce operational toil and improve decision-making
Requirements
What you’ll need- Bachelor’s degree in computer science, Engineering, or a related field.
- 6-10 years of hands-on experience in software operations, DevOps and Site Reliability Engineering, including managing large-scale, mission-critical systems.
- Clear and confident communication skills with ability to lead teams and collaborate effectively across engineering, product, and architecture teams.
- Proven track record ensuring high availability and performance in production environments, with expertise in fault-tolerant, distributed system design.
- Excellent understanding of modern software delivery pipelines and DevOps practices, including CI/CD, configuration management, and version control (Git).
- Exceptional problem-solving skills, with experience diagnosing complex system issues under pressure and driving them to resolution.
- Strong proficiency in at least one programming or scripting language (e.g., Python, Bash, or Java) for automation and tool integration.
- Self-driven and proactive, with a passion for automating manual processes and continuously improving systems to enhance reliability and team productivity.
Benefits
Comp & perks- Remote first / work from home culture
- Flexible vacation to help you rest, recharge, and connect with loved ones
- Paid leave benefits
- Health, dental, and vision insurance
- 401k retirement savings plan
- Infertility benefits
- Tuition reimbursement, life insurance, EAP – and more!
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
DevOpsSite Reliability EngineeringCI/CDconfiguration managementversion controlPythonBashJavaautomationperformance optimization
Soft Skills
leadershipcommunicationcollaborationproblem-solvingself-drivenproactiveaccountabilitycontinuous improvementmentoringrisk reduction