FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Senior Site Reliability Engineer
SecurityScorecardSenior Site Reliability Engineer optimizing and securing Kubernetes infrastructure at cybersecurity ratings leader. Collaborating with engineering teams to enhance delivery and production reliability.
Posted 6/9/2026full-timeAustin • Texas • 🇺🇸 United StatesSenior💰 $152,000 - $195,000 per yearWebsite
Tech Stack
Tools & technologiesCloudGoGrafanaJenkinsKafkaKubernetesPrometheusPythonTerraform
About the role
Key responsibilities & impact- Design, build, and scale Kubernetes infrastructure for secure, multi-tenant, high-availability applications.
- Build and operate AI tooling infrastructure — stand up MCP servers and establish secure, governed AI access and guardrails for production systems.
- Optimize and maintain CI/CD pipelines, improving reliability, speed, and rollback safety.
- Implement progressive delivery strategies such as blue/green and canary deployments.
- Advance Infrastructure as Code with Terraform, Helm, and Argo CD, defining reusable patterns for the org.
- Operate and optimize streaming and analytics infrastructure: Kafka, Flink, and ClickHouse.
- Build automated testing into the CI/CD lifecycle.
- Improve system observability — define SLOs, alerts, and dashboards.
- Lead incident response and postmortems, focusing on root cause and durable fixes.
- Mentor engineers across teams on Kubernetes, CI/CD, and cloud infrastructure.
Requirements
What you’ll need- 6+ years in SRE, DevOps, or Infrastructure roles, with significant production Kubernetes experience.
- Hands-on experience integrating AI/LLM tooling into engineering or operational workflows.
- Proven success building CI/CD pipelines (GitHub Actions, Jenkins, GitLab CI, or similar).
- Strong with Kubernetes internals and managed services like EKS, GKE, or AKS.
- Expertise with Infrastructure as Code (Terraform, Helm, Pulumi) and GitOps.
- Proficient in Python, Bash, or Go.
- Knowledge of observability tooling (Prometheus, Grafana, Datadog, OpenTelemetry).
- Production experience with Kafka, Flink, and ClickHouse.
- Strong communication and cross-team collaboration skills.
Benefits
Comp & perks- Competitive salary
- Stock options
- Health benefits
- Unlimited PTO
- Parental leave
- Tuition reimbursements
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
KubernetesCI/CDInfrastructure as CodeTerraformHelmArgo CDPythonBashGoKafka
Soft Skills
communicationcross-team collaborationmentoringincident responsepostmortems