Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
OneStream Software

Site Reliability Engineer

OneStream Software

Site Reliability Engineer managing scalable cloud services in a remote environment. Collaborating with teams to ensure reliability and performance in infrastructure deployments.

Posted 6/6/2026full-timeRemote • 🇺🇸 United StatesMid-LevelSenior💰 $114,000 - $148,000 per yearWebsite

Tech Stack

Tools & technologies
AnsibleAWSAzureChefCloudGoogle Cloud PlatformGrafanaKubernetesOpenShiftPrometheusPuppetPythonTerraform

About the role

Key responsibilities & impact
  • Implement application/infrastructure observability solutions to ensure desired application availability, reliability, and performance
  • Participate in regular On-Call rotations and share details related to incidents and their resolution through post-mortem reports and regular review meetings
  • Proactively partner with Product and Engineering teams to identify, develop, deploy, and maintain reliable systems and services
  • Influence and create new designs, architectures, standards, and methods for large-scale systems
  • Sustain a high level of reliability for key services and automated systems
  • Automate processes to improve reliability, performance, and availability
  • Update technical documentation, workflows, and knowledge base articles
  • Provide feedback in pull requests and peer coding reviews
  • Implement codified automated solutions that build integrations between Dynatrace, Azure DevOps and Jira
  • Solid knowledge in focused areas of OneStream Software
  • Ability to mentor others in several technical areas
  • Understanding practical use of SOC/FedRAMP controls to assist Compliance and Security teams

Requirements

What you’ll need
  • BS/BA in computer science, engineering, or technology-related field (or equivalent work experience)
  • Proven work experience as a Site Reliability Engineer or in a similar role
  • 6+ years of cloud infrastructure and software development experience
  • 2+ years hands on experience of Azure Kubernetes Services (AKS) with container-based deployment skills or other platforms such as OpenShift, GKS, EKS
  • Advanced understanding of APM and observability tools such as Dynatrace, AppInsights, DataDog, Log Analytics, New Relic, Prometheus and Grafana
  • Advanced understanding of Infrastructure-as-Code (IaC) concepts and tooling (Terraform, CloudFormation templates, Bicep or ARM templates) on Microsoft Azure, Amazon Web Services (AWS), or Google Cloud Platform (GCP)
  • Deep knowledge of Configuration Management/Orchestration utilities such as Ansible, PowerShell DSC, Chef, and Puppet
  • Advanced understanding of cloud concepts including elasticity, security, and identity management
  • Well versed familiarity with Agile Development methodologies utilizing Jira or Azure DevOps Boards
  • 6+ years of hands-on experience with the following technologies, tools, and concepts: Automating processes using PowerShell, Bash, CLI, REST APIs, python, ARM Templates or other scripting languages
  • Comfortable leveraging source control tools such as Git, Azure DevOps, or GitHub
  • Knowledge of container orchestration platforms such as Kubernetes, OpenShift, AKS, GKS or helm
  • Microsoft Azure, Amazon Web Services (AWS) or Google Cloud (GCP)

Benefits

Comp & perks
  • Vision
  • Medical
  • Life
  • Dental
  • 401K

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Site Reliability Engineeringcloud infrastructuresoftware developmentAzure Kubernetes Services (AKS)Infrastructure-as-Code (IaC)TerraformConfiguration ManagementAnsiblePowerShellPython
Soft Skills
mentoringcollaborationproblem-solvingcommunicationinfluencefeedbackdocumentationautomation mindsetproactive approachincident resolution
Certifications
BS/BA in computer scienceengineering degreetechnology-related field