Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
Berkeley Research Group (BRG)

Site Reliability Engineer

Berkeley Research Group (BRG)

Site Reliability Engineer designing, building, and maintaining highly available systems for health technology company. Collaborating with software developers to improve reliability and automate processes.

Posted 6/27/2026full-timeRemote • California • 🇺🇸 United StatesMid-LevelSenior💰 $130,000 - $160,000 per yearWebsite

Tech Stack

Tools & technologies
AWSAzureCloudGoGoogle Cloud PlatformKubernetesPythonRuby

About the role

Key responsibilities & impact
  • Design, implement, and maintain scalable and reliable systems in cloud environments such as Azure Cloud Services.
  • Provide operational support for full-stack software applications.
  • Increase system resilience with expert-level coding, bulletproof release, and change management skills.
  • Develop service-level indicators and objectives to automate release validation.
  • Improve automation and increase the system’s self-healing capability.
  • Collect operating system data and report performance metrics to stakeholders.
  • Ensure security best practices are followed in cloud infrastructure and application deployments.
  • Manage cloud and database system maintenance, debugging production issues as they arise.
  • Improve reliability, quality, and time-to-market of our suite of software solutions.
  • Partner with security and product teams to define and publish policies, processes, and playbooks to facilitate rapid and effective handling of alerts and incidents.
  • Lead incident management processes; respond to outages and service disruptions promptly.

Requirements

What you’ll need
  • Bachelor’s degree in computer science or similar field.
  • Five years’ experience as a site reliability engineer or similar role.
  • Strong programming skills (Golang, Ruby, Python, or similar).
  • Proven ability to diagnose and monitor performance and reliability issues across the stack.
  • Expertise in Kubernetes.
  • Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.
  • Proven experience working with cloud-native infrastructure (Azure Cloud Services, AWS, or GCP).
  • Experience working with observability and incident management tools (Datadog, OpsGenie, PagerDuty).
  • Experience scripting operating system tasks with Infrastructure as Code.
  • Impeccable communication skills.
  • Ability to problem-solve in a fast-paced, high-stakes environment.
  • Candidate must be able to submit verification of his/her legal right to work in the United States, without company sponsorship.

Benefits

Comp & perks
  • 🌐 Worldwide ❌ Jobs You've Hidden ⭐️ Saved Jobs ✅ Applied Jobs ✉️ Email Alerts 👤 Account Berkeley Research Group (BRG) Website LinkedIn All Job Openings 1001 - 5000 employees 💰 Venture Round on 2020-07 Berkeley Research Group (BRG) is a global consulting firm that helps leading organizations advance in the fields of corporate finance; economics, disputes, and investigations; and performance improvement. With offices around the world, we are an integrated group of experts, industry leaders, academics, data scientists, and professionals working beyond borders and disciplines. We harness our collective expertise to deliver the inspired insights and practical strategies our clients need to stay ahead of what's next. Site Reliability Engineer 🔥 1 hour ago 🏄 California – Remote 💵 $130k - $160k / year ⏰ Full Time 🟡 Mid-level 🟠 Senior ⛑ DevOps & Site Reliability Engineer (SRE) AWS Azure Cloud Google Cloud Platform Kubernetes Python Ruby Go Apply Now Find Hiring Managers Customize resume + cover letter Report problem ☆ Save ☑️ Mark as applied ❌ Hide 📋 Description
  • Design, implement, and maintain scalable and reliable systems in cloud environments such as Azure Cloud Services.
  • Provide operational support for full-stack software applications.
  • Increase system resilience with expert-level coding, bulletproof release, and change management skills.
  • Develop service-level indicators and objectives to automate release validation.
  • Improve automation and increase the system’s self-healing capability.
  • Collect operating system data and report performance metrics to stakeholders.
  • Ensure security best practices are followed in cloud infrastructure and application deployments.
  • Manage cloud and database system maintenance, debugging production issues as they arise.
  • Improve reliability, quality, and time-to-market of our suite of software solutions.
  • Partner with security and product teams to define and publish policies, processes, and playbooks to facilitate rapid and effective handling of alerts and incidents.
  • Lead incident management processes; respond to outages and service disruptions promptly. 🎯 Requirements
  • Bachelor’s degree in computer science or similar field.
  • Five years’ experience as a site reliability engineer or similar role.
  • Strong programming skills (Golang, Ruby, Python, or similar).
  • Proven ability to diagnose and monitor performance and reliability issues across the stack.
  • Expertise in Kubernetes.
  • Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.
  • Proven experience working with cloud-native infrastructure (Azure Cloud Services, AWS, or GCP).
  • Experience working with observability and incident management tools (Datadog, OpsGenie, PagerDuty).
  • Experience scripting operating system tasks with Infrastructure as Code.
  • Impeccable communication skills.
  • Ability to problem-solve in a fast-paced, high-stakes environment.
  • Candidate must be able to submit verification of his/her legal right to work in the United States, without company sponsorship. Apply Now 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score Similar Jobs Cloud DevOps Engineer 🔥 3 hours ago CACI International Inc 10,000+ employees 🔒 Cybersecurity Website LinkedIn All Job Openings Cloud DevOps Engineer managing CI/CD pipelines and applications in AWS cloud. Collaborating on security initiatives and providing DevSecOps training with Agile teams. 🇺🇸 United States – Remote 💵 $82.1k - $172.4k / year ⏰ Full Time 🟡 Mid-level 🟠 Senior ⛑ DevOps & Site Reliability Engineer (SRE) Ansible AWS Cloud Docker EC2 Firewalls Grafana Java JavaScript Kubernetes OpenShift Prometheus Python SDLC Splunk Terraform Go Senior Site Reliability Engineer – Release 🔥 3 hours ago Alkami Technology 501 - 1000 🏦 Banking 💳 Fintech ☁️ SaaS Website LinkedIn All Job Openings Site Reliability Engineer at Alkami developing and testing code for application releases. Collaborating with teams to improve delivery and participate in on-call rotations. 🇺🇸 United States – Remote 💵 $110k - $137.5k / year 💰 $300M Post-IPO Debt - Alkami Technology on 2025-03 ⏰ Full Time 🟠 Senior ⛑ DevOps & Site Reliability Engineer (SRE) Ansible Docker Jenkins Kubernetes Postgres Python Redis Senior DevOps Engineer/Senior Associate Software Engineer 🔥 3 hours ago Amgen 10,000+ employees 🧬 Biotechnology 💊 Pharmaceuticals 🔬 Science Website LinkedIn All Job Openings DevOps Engineer responsible for designing, developing, and maintaining critical software applications for a biotech company. Collaborate with teams to deliver innovative solutions that impact patient care. 🇺🇸 United States – Remote 💵 $115.4k - $156.1k / year 💰 $28.5G Post-IPO Debt on 2022-12 ⏰ Full Time 🟠 Senior ⛑ DevOps & Site Reliability Engineer (SRE) 🦅 H1B Visa Sponsor AWS Cloud EC2 Jenkins Python DevSecOps Engineer 🔥 8 hours ago Oddball 51 - 200 🏛️ Government ☁️ SaaS 🤝 B2B Website LinkedIn All Job Openings DevSecOps Engineer supporting federal CMS BDAMAX program impacting Medicare experience for millions. Collaborating with teams to improve security practices and automate workflows. 🇺🇸 United States – Remote 💵 $100k - $140k / year ⏰ Full Time 🟡 Mid-level 🟠 Senior ⛑ DevOps & Site Reliability Engineer (SRE) AWS EC2 Jenkins Kubernetes Postgres Terraform DevOps Engineer 🔥 10 hours ago THEMIS Waste Recovery Technology 11 - 50 💳 Fintech 🏦 Banking 📋 Compliance Website LinkedIn All Job Openings DevOps Engineer managing cloud infrastructure and CI/CD pipelines for secure, reliable operations at Themis. Ensuring system health and performance while automating processes and maintaining security controls. 🇺🇸 United States – Remote ⏰ Full Time 🟡 Mid-level 🟠 Senior ⛑ DevOps & Site Reliability Engineer (SRE) AWS Azure Cloud Docker Google Cloud Platform Kubernetes Python Terraform Go View More DevOps Jobs 🌐 Worldwide Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com Search Search Jobs by country Search jobs by city Search jobs by job title Search entry-level jobs Search junior-level jobs Search senior-level jobs Search jobs by tech stack Search jobs by contract type Search remote internships Search remote part-time jobs Remote jobs Anywhere in the World Companies Hiring Anywhere in the World Companies Hiring Sales People Anywhere in the World Companies Hiring Software Engineers Anywhere in the World Resources Advice Tips for finding remote jobs Interview questions and answers Resume examples Cover letter examples Post a job Affiliates Privacy policy Terms of service Job board SEO course AI Apply Copilot OpenClaw job finder Jobs by Country Remote jobs anywhere in the world (Worldwide remote jobs) Remote jobs United States Remote jobs Australia Remote jobs Brazil Remote jobs Canada Remote jobs France Remote jobs Ireland Remote jobs Germany Remote jobs Netherlands Remote jobs Spain Remote jobs UK Popular Jobs Remote data analyst jobs Remote customer support jobs Remote executive assistant jobs Remote marketing jobs Remote product designer jobs Remote product manager jobs Remote project manager jobs Remote recruiter jobs Remote sales jobs Remote software engineer jobs Jobs by Type Remote full-time jobs Remote part-time jobs Remote contract jobs Remote internship jobs Remote entry-level jobs Remote jobs with no experience required Remote junior jobs (1-3 years of experience) Digital nomad jobs Remote jobs with no degree required Freelance remote jobs Temporary remote jobs Remote jobs hiring now Stay at home mom jobs

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
GolangRubyPythonKubernetesInfrastructure as Codecloud-native infrastructurerelease validationperformance monitoringsystem resilienceautomation
Soft Skills
communicationproblem-solvingincident managementcollaborationleadershipadaptabilitytime managementcritical thinkingattention to detailstakeholder engagement
Certifications
Bachelor's degree in computer scienceSite Reliability Engineering (SRE) Foundation certification