Runlayer

Senior Site Reliability Engineer

Runlayer

full-time

Posted on:

Location Type: Remote

Location: United States

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Own reliability and performance of our cloud infrastructure across AWS (ECS, Aurora, CloudWatch) and GCP
  • Manage and optimize Kubernetes clusters and container orchestration
  • Drive database reliability engineering, including performance tuning and scaling
  • Build and maintain CI/CD pipelines for rapid, safe deployments
  • Run incident response and on-call rotations
  • Partner with product engineers to design scalable, resilient systems

Requirements

  • Strong AWS experience, particularly ECS, Aurora, and CloudWatch
  • GCP experience as we expand cross-cloud
  • Kubernetes and container orchestration expertise
  • DBRE experience with database performance tuning
  • CI/CD pipeline ownership and incident response experience
  • Background at a B2B SaaS company serving enterprise customers, ideally in infrastructure
  • Bonus Qualifications: Experience deploying and supporting on-prem or hybrid environments, Python backend familiarity (our platform is Python-based), Experience at an early-stage or high-growth company
Benefits
  • Competitive salary and equity — compensation that reflects your expertise and customer-facing responsibilities.
  • Paid time off — 4 weeks paid vacation, paid sick leave, and paid parental leave.
  • Professional development — budget for conferences, courses, and certifications in AI, enterprise software, and customer success.
  • Top-tier equipment — your choice of laptop and accessories to create your ideal work environment.
  • Health benefits — comprehensive health, dental, and vision coverage.
  • Customer interaction opportunities — work directly with innovative companies and see the immediate impact of your work.
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
AWSECSAuroraCloudWatchGCPKubernetescontainer orchestrationdatabase performance tuningCI/CD pipelinesPython
Soft Skills
incident responseon-call rotationscollaboration