FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Senior Site Reliability Engineer
RapidSOSSenior Site Reliability Engineer responsible for performance and reliability of technology powering emergency response. Working across infrastructure and application layers to enhance system reliability and resilience.
Posted 4/21/2026full-timeRemote • New York • 🇺🇸 United StatesSenior💰 $160,000 - $195,000 per yearWebsite
Tech Stack
Tools & technologiesAWSCloudDistributed SystemsDNSKafkaKubernetesPythonRabbitMQ
About the role
Key responsibilities & impact- Own performance and reliability outcomes: Ownership of how application-level decisions create system-level impact
- Design for system resilience: Responsibility for strengthening reliability through proactive design decisions
- Build observability into system behavior: Proactively instrument services with structured logging
- Own incidents from signal to resolution: Ownership of production issues from first signal through resolution
- Work across the stack without a permission slip: You’ll work across infrastructure-as-code, container orchestration, CI/CD pipelines, and service-level application code
Requirements
What you’ll need- 5+ years of professional engineering experience with deep expertise in Python
- Real cloud infrastructure experience with AWS: networking, managed databases, cost implications of traffic routing decisions, IAM, DNS-based routing and failover
- Hands-on kubernetes experience with containerized workloads in production across EKS, ECS, or Fargate
- Strong understanding of distributed systems and how they fail
- Experience operating high-throughput messaging systems (RabbitMQ, Kafka, AWS SNS / SQS, etc.)
- Experience building or improving observability through logging, metrics, and alerting
- Demonstrable experience in using AI to safely and securely enhance velocity, improve reliability and recoverability of services
- Strong proficiency in coding best practices – ability to write clean, maintainable, and testable code
- Demonstrated expertise in problem solving
Benefits
Comp & perks- Competitive salary and benefits and equity participation
- A dynamic, flexible and fun start-up work environment with a highly talented team
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PythonAWSKubernetesEKSECSFargateRabbitMQKafkaAWS SNSAWS SQS
Soft Skills
problem solvingownershipproactive designobservabilityreliabilitycommunication