Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
RapidSOS

Senior Site Reliability Engineer

RapidSOS

. Own performance and reliability outcomes: Ownership of how application-level decisions create system-level impact .

Posted 4/21/2026full-timeRemote • New York • 🇺🇸 United StatesSenior💰 $160,000 - $195,000 per yearWebsite

Tech Stack

Tools & technologies
AWSCloudDistributed SystemsDNSKafkaKubernetesPythonRabbitMQ

About the role

Key responsibilities & impact
  • Own performance and reliability outcomes: Ownership of how application-level decisions create system-level impact
  • Design for system resilience: Responsibility for strengthening reliability through proactive design decisions
  • Build observability into system behavior: Proactively instrument services with structured logging
  • Own incidents from signal to resolution: Ownership of production issues from first signal through resolution
  • Work across the stack without a permission slip: You’ll work across infrastructure-as-code, container orchestration, CI/CD pipelines, and service-level application code

Requirements

What you’ll need
  • 5+ years of professional engineering experience with deep expertise in Python
  • Real cloud infrastructure experience with AWS: networking, managed databases, cost implications of traffic routing decisions, IAM, DNS-based routing and failover
  • Hands-on kubernetes experience with containerized workloads in production across EKS, ECS, or Fargate
  • Strong understanding of distributed systems and how they fail
  • Experience operating high-throughput messaging systems (RabbitMQ, Kafka, AWS SNS / SQS, etc.)
  • Experience building or improving observability through logging, metrics, and alerting
  • Demonstrable experience in using AI to safely and securely enhance velocity, improve reliability and recoverability of services
  • Strong proficiency in coding best practices – ability to write clean, maintainable, and testable code
  • Demonstrated expertise in problem solving

Benefits

Comp & perks
  • Competitive salary and benefits and equity participation
  • A dynamic, flexible and fun start-up work environment with a highly talented team

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
PythonAWSKubernetesEKSECSFargateRabbitMQKafkaAWS SNSAWS SQS
Soft Skills
problem solvingownershipproactive designobservabilityreliabilitycommunication