Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
RapidSOS

Senior Site Reliability Engineer

RapidSOS

Senior Site Reliability Engineer responsible for performance and reliability of technology powering emergency response. Working across infrastructure and application layers to enhance system reliability and resilience.

Posted 4/21/2026full-timeRemote • New York • 🇺🇸 United StatesSenior💰 $160,000 - $195,000 per yearWebsite

Tech Stack

Tools & technologies
AWSCloudDistributed SystemsDNSKafkaKubernetesPythonRabbitMQ

About the role

Key responsibilities & impact
  • Own performance and reliability outcomes: Ownership of how application-level decisions create system-level impact
  • Design for system resilience: Responsibility for strengthening reliability through proactive design decisions
  • Build observability into system behavior: Proactively instrument services with structured logging
  • Own incidents from signal to resolution: Ownership of production issues from first signal through resolution
  • Work across the stack without a permission slip: You’ll work across infrastructure-as-code, container orchestration, CI/CD pipelines, and service-level application code

Requirements

What you’ll need
  • 5+ years of professional engineering experience with deep expertise in Python
  • Real cloud infrastructure experience with AWS: networking, managed databases, cost implications of traffic routing decisions, IAM, DNS-based routing and failover
  • Hands-on kubernetes experience with containerized workloads in production across EKS, ECS, or Fargate
  • Strong understanding of distributed systems and how they fail
  • Experience operating high-throughput messaging systems (RabbitMQ, Kafka, AWS SNS / SQS, etc.)
  • Experience building or improving observability through logging, metrics, and alerting
  • Demonstrable experience in using AI to safely and securely enhance velocity, improve reliability and recoverability of services
  • Strong proficiency in coding best practices – ability to write clean, maintainable, and testable code
  • Demonstrated expertise in problem solving

Benefits

Comp & perks
  • Competitive salary and benefits and equity participation
  • A dynamic, flexible and fun start-up work environment with a highly talented team

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
PythonAWSKubernetesEKSECSFargateRabbitMQKafkaAWS SNSAWS SQS
Soft Skills
problem solvingownershipproactive designobservabilityreliabilitycommunication