Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
Aya Healthcare

Manager, Site Reliability Engineering

Aya Healthcare

Lead the SRE team at Aya Healthcare for enhancing product reliability and operational efficiency. Manage incident responses and AI-native operations for a top healthcare workforce solutions provider.

Posted 6/9/2026full-timeRemote • 🇺🇸 United StatesSeniorLead💰 $230,000 - $255,000 per yearWebsite

Tech Stack

Tools & technologies
AWSAzureGoogle Cloud Platform

About the role

Key responsibilities & impact
  • Lead and grow the SRE team
  • Drive reliability, performance, and availability
  • Operational intelligence and AI-native operations
  • Platform efficiency and stakeholder trust

Requirements

What you’ll need
  • 10+ years in a combination of Site Reliability Engineering, DevOps, Platform Engineering, or related production-operations roles.
  • 4+ years of direct people management experience — hiring, performance management, career development, and running remote on-call teams.
  • Demonstrated ownership of reliability outcomes for customer-facing SaaS at meaningful scale — defining and operationalizing SLOs/SLIs/error budgets and using them to drive engineering prioritization.
  • Deep Azure experience — 3+ years operating production workloads on Azure, with hands-on depth in AKS, networking, identity, and platform services. Equivalent depth in AWS or GCP will be considered.
  • Modern observability fluency — production-grade experience with Datadog (or equivalent: New Relic, Dynatrace, AppDynamics) across metrics, logs, traces, RUM, and synthetics.
  • AI in operations — hands-on experience integrating AI/LLM-assisted tooling into operational workflows (incident summarization, runbook generation, log analysis, anomaly triage, change risk scoring).
  • Incident command experience — proven ability to lead severity-1 incidents end-to-end, run blameless reviews, and convert lessons into systemic improvements.
  • Regulated-environment instinct — operates with HIPAA, PHI, SOC 2, or comparable compliance constraints as a default mindset, not an afterthought.
  • Executive-grade communication — translates reliability work into business outcomes for executive, product, and customer-facing audiences.
  • Bachelor's degree in Computer Science, Information Technology, Engineering, or related field — or an equivalent combination of education, training, and experience.

Benefits

Comp & perks
  • Free premium medical, dental, life and vision insurance
  • Generous 401(k) match
  • Aya also offers other benefits to those that are eligible and where required by applicable law, including reimbursements and discretionary bonuses
  • Aya provides paid sick leave in accordance with all applicable state, federal, and local laws. Aya’s general sick leave policy is that employees accrue one hour of paid sick leave for every 30 hours worked. However, to the extent any provisions of the statement above conflict with any applicable paid sick leave laws, the applicable paid sick leave laws are controlling
  • Celebrations! We hit our goals and reward ourselves.
  • Company-sponsored virtual events, happy hours and team-building activities are always on the horizon — plus, you get a special treat on your birthday!
  • Unlimited DTO — we believe in time off!
  • Virtual yoga, meditation or boot camp classes offered daily

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Site Reliability EngineeringDevOpsPlatform EngineeringSLOsSLIserror budgetsAzureAKSobservabilityAI in operations
Soft Skills
people managementperformance managementcareer developmentexecutive-grade communicationincident commandblameless reviewsstakeholder trustoperational intelligence