FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Site Reliability Engineer
OneStream SoftwareSite Reliability Engineer managing scalable cloud services in a remote environment. Collaborating with teams to ensure reliability and performance in infrastructure deployments.
Posted 6/6/2026full-timeRemote • 🇺🇸 United StatesMid-LevelSenior💰 $114,000 - $148,000 per yearWebsite
Tech Stack
Tools & technologiesAnsibleAWSAzureChefCloudGoogle Cloud PlatformGrafanaKubernetesOpenShiftPrometheusPuppetPythonTerraform
About the role
Key responsibilities & impact- Implement application/infrastructure observability solutions to ensure desired application availability, reliability, and performance
- Participate in regular On-Call rotations and share details related to incidents and their resolution through post-mortem reports and regular review meetings
- Proactively partner with Product and Engineering teams to identify, develop, deploy, and maintain reliable systems and services
- Influence and create new designs, architectures, standards, and methods for large-scale systems
- Sustain a high level of reliability for key services and automated systems
- Automate processes to improve reliability, performance, and availability
- Update technical documentation, workflows, and knowledge base articles
- Provide feedback in pull requests and peer coding reviews
- Implement codified automated solutions that build integrations between Dynatrace, Azure DevOps and Jira
- Solid knowledge in focused areas of OneStream Software
- Ability to mentor others in several technical areas
- Understanding practical use of SOC/FedRAMP controls to assist Compliance and Security teams
Requirements
What you’ll need- BS/BA in computer science, engineering, or technology-related field (or equivalent work experience)
- Proven work experience as a Site Reliability Engineer or in a similar role
- 6+ years of cloud infrastructure and software development experience
- 2+ years hands on experience of Azure Kubernetes Services (AKS) with container-based deployment skills or other platforms such as OpenShift, GKS, EKS
- Advanced understanding of APM and observability tools such as Dynatrace, AppInsights, DataDog, Log Analytics, New Relic, Prometheus and Grafana
- Advanced understanding of Infrastructure-as-Code (IaC) concepts and tooling (Terraform, CloudFormation templates, Bicep or ARM templates) on Microsoft Azure, Amazon Web Services (AWS), or Google Cloud Platform (GCP)
- Deep knowledge of Configuration Management/Orchestration utilities such as Ansible, PowerShell DSC, Chef, and Puppet
- Advanced understanding of cloud concepts including elasticity, security, and identity management
- Well versed familiarity with Agile Development methodologies utilizing Jira or Azure DevOps Boards
- 6+ years of hands-on experience with the following technologies, tools, and concepts: Automating processes using PowerShell, Bash, CLI, REST APIs, python, ARM Templates or other scripting languages
- Comfortable leveraging source control tools such as Git, Azure DevOps, or GitHub
- Knowledge of container orchestration platforms such as Kubernetes, OpenShift, AKS, GKS or helm
- Microsoft Azure, Amazon Web Services (AWS) or Google Cloud (GCP)
Benefits
Comp & perks- Vision
- Medical
- Life
- Dental
- 401K
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Site Reliability Engineeringcloud infrastructuresoftware developmentAzure Kubernetes Services (AKS)Infrastructure-as-Code (IaC)TerraformConfiguration ManagementAnsiblePowerShellPython
Soft Skills
mentoringcollaborationproblem-solvingcommunicationinfluencefeedbackdocumentationautomation mindsetproactive approachincident resolution
Certifications
BS/BA in computer scienceengineering degreetechnology-related field