Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
qode.world

Senior Site Reliability Engineer

qode.world

. Design and implement unified observability dashboards across metrics, logs, traces, events, and topology .

Posted 4/21/2026full-timeTexas • South Carolina, Texas • 🇺🇸 United StatesSeniorWebsite

Tech Stack

Tools & technologies
Distributed SystemsKafka

About the role

Key responsibilities & impact
  • Design and implement unified observability dashboards across metrics, logs, traces, events, and topology
  • Define and manage SLIs, SLOs, and error budgets aligned to business outcomes
  • Build actionable dashboards for operations, engineering, and leadership
  • Implement alerting strategies using static and dynamic thresholds
  • Leverage AI/ML/AIOps to detect anomalies, predict incidents, and reduce MTTR
  • Transition monitoring from reactive alerts to proactive insights
  • Implement noise reduction, alert correlation, and root cause analysis
  • Apply baseline modeling, seasonality detection, and anomaly scoring
  • Monitor and troubleshoot multi-service architectures
  • Identify whether issues originate from upstream/downstream dependencies, streaming platform, infrastructure, or application code
  • Deep hands-on experience with Dynatrace (mandatory)

Requirements

What you’ll need
  • 15+ years in SRE / Production Engineering
  • Strong Unified Observability background (not infra-only)
  • Hands-on Dynatrace experience (metrics, traces, logs, Davis AI)
  • SLI/SLO engineering experience in production systems
  • Experience implementing dynamic thresholds and anomaly detection
  • Knowledge of AI/ML concepts applied to Ops (AIOps)
  • Distributed systems troubleshooting expertise
  • Experience with Kafka or streaming data platforms

Benefits

Comp & perks
  • 🌐 Worldwide ❌ Jobs You've Hidden ⭐️ Saved Jobs ✅ Applied Jobs ✉️ Email Alerts 👤 Account qode.world Website LinkedIn All Job Openings 11 - 50 employees 🤖 Artificial Intelligence 👥 HR Tech 🎯 Recruiter Artificial Intelligence
  • HR Tech
  • Recruitment qode. world is a company that leverages artificial intelligence to revolutionize the recruiting process. Their platform allows users to find candidates by sourcing data from billions of data points worldwide and provides data-driven insights. Users can connect with candidates directly through the platform, conduct customized AI-led interviews, and get comprehensive assessments. The service also integrates easily with LinkedIn, enhancing the talent pool and facilitating direct communication with candidates listed there. Qode. world offers additional recruiting services to assist in hiring for niche or senior roles. They are praised for their effectiveness in streamlining the hiring process and delivering quick results. Senior Site Reliability Engineer 🔥 54 minutes ago 🏢🏡 Texas – Hybrid ⏰ Full Time 🟠 Senior ⛑ DevOps & Site Reliability Engineer (SRE) Distributed Systems Kafka Apply Now Find Hiring Managers Customize resume for this job Report problem ☆ Save ☑️ Mark as applied ❌ Hide 📋 Description
  • Design and implement unified observability dashboards across metrics, logs, traces, events, and topology
  • Define and manage SLIs, SLOs, and error budgets aligned to business outcomes
  • Build actionable dashboards for operations, engineering, and leadership
  • Implement alerting strategies using static and dynamic thresholds
  • Leverage AI/ML/AIOps to detect anomalies, predict incidents, and reduce MTTR
  • Transition monitoring from reactive alerts to proactive insights
  • Implement noise reduction, alert correlation, and root cause analysis
  • Apply baseline modeling, seasonality detection, and anomaly scoring
  • Monitor and troubleshoot multi-service architectures
  • Identify whether issues originate from upstream/downstream dependencies, streaming platform, infrastructure, or application code
  • Deep hands-on experience with Dynatrace (mandatory) 🎯 Requirements
  • 15+ years in SRE / Production Engineering
  • Strong Unified Observability background (not infra-only)
  • Hands-on Dynatrace experience (metrics, traces, logs, Davis AI)
  • SLI/SLO engineering experience in production systems
  • Experience implementing dynamic thresholds and anomaly detection
  • Knowledge of AI/ML concepts applied to Ops (AIOps)
  • Distributed systems troubleshooting expertise
  • Experience with Kafka or streaming data platforms Apply Now 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score 🌐 Worldwide Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com Search Search Jobs by country Search jobs by city Search jobs by job title Search entry-level jobs Search junior-level jobs Search senior-level jobs Search jobs by tech stack Search jobs by contract type Search remote internships Search remote part-time jobs Remote jobs Anywhere in the World Companies Hiring Anywhere in the World Companies Hiring Sales People Anywhere in the World Companies Hiring Software Engineers Anywhere in the World Resources Advice Tips for finding remote jobs Interview questions and answers Resume examples Cover letter examples Post a job Affiliates Privacy policy Terms of service Job board SEO course AI Apply Copilot OpenClaw job finder Jobs by Country Remote jobs anywhere in the world (Worldwide remote jobs) Remote jobs United States Remote jobs Australia Remote jobs Brazil Remote jobs Canada Remote jobs France Remote jobs Ireland Remote jobs Germany Remote jobs Netherlands Remote jobs Spain Remote jobs UK Popular Jobs Remote data analyst jobs Remote customer support jobs Remote executive assistant jobs Remote marketing jobs Remote product designer jobs Remote product manager jobs Remote project manager jobs Remote recruiter jobs Remote sales jobs Remote software engineer jobs Jobs by Type Remote full-time jobs Remote part-time jobs Remote contract jobs Remote internship jobs Remote entry-level jobs Remote jobs with no experience required Remote junior jobs (1-3 years of experience) Digital nomad jobs Remote jobs with no degree required Freelance remote jobs Temporary remote jobs Remote jobs hiring now Stay at home mom jobs

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
SLISLOerror budgetsalerting strategiesanomaly detectionbaseline modelingseasonality detectionroot cause analysismulti-service architecturesdistributed systems troubleshooting
Soft Skills
leadershipcommunication