FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Site Reliability Engineer
VeriskSite Reliability Engineer responsible for designing and operating multi-region architectures in production services. Collaborating with a small engineering team to ensure high availability across systems.
Tech Stack
Tools & technologiesAWSCloudDistributed SystemsDockerKubernetesSplunk
About the role
Key responsibilities & impact- Design and operate multi-region architectures (active/active or active/passive)
- Implement and improve automated failover and traffic routing
- Identify and eliminate single points of failure
- Ensure regional isolation and graceful degradation when dependencies fail
- Define realistic availability goals and failure scenarios
- Help the team understand RTO/RPO trade-offs
- Build and maintain clear, actionable observability (metrics, logs, traces)
- Create alerts that detect real problems without noise
- Participate in on-call and help improve incident response
- Reduce manual operational work through automation
- Monitor performance and saturation across regions
Requirements
What you’ll need- Experience operating production systems with real availability requirements
- Hands-on experience with cloud infrastructure and distributed systems
- Strong understanding of high availability patterns
- Comfortable being hands-on: debugging, automating, improving systems
- Pragmatic mindset — you know when simple is better than perfect
- Clear communicator who works well in a small, collaborative team
- Strong expertise in Amazon Web Services (multi-region architecture)
- Experience designing Active-Active / Active-Passive deployments
- Advanced knowledge of VPC networking, IAM, Route 53, Load Balancers, and EKS
- Advanced experience with Kubernetes (EKS preferred)
- Strong knowledge of Docker
- Advanced knowledge of Splunk
- Strong expertise in Dynatrace
Benefits
Comp & perks- Health insurance
- Flexible work arrangements
- Paid time off
- Professional development
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
cloud infrastructuredistributed systemshigh availability patternsautomationAmazon Web ServicesActive-Active deploymentsActive-Passive deploymentsVPC networkingKubernetesDocker
Soft Skills
clear communicatorcollaborative team playerpragmatic mindsetdebuggingincident response