
Site Reliability Engineer
Minor Hotels Europe and Americas
full-time
Posted on:
Location Type: Office
Location: Hyderabad • India
Visit company websiteExplore more
Tech Stack
About the role
- Ensures reliability, scalability, and performance of systems across any cloud platform
- Focuses on observability, incident management, and automation
- Manage infrastructure on AWS, Azure, or GCP
- Implement IaC using Terraform and Deployment Manager
- Build CI/CD pipelines using GitHub Actions and other tools
- Containerize and orchestrate workloads using Docker and Kubernetes
- Automate tasks using Linux, Bash, and Python scripting
- Monitor systems using Kibana, Splunk, New Relic, and APM tools like Grafana, Dynatrace, AppDynamics, Datadog
- Define and manage SLIs/SLOs, error budgets
- Work with ITSM/ITIL processes, ticketing tools like JIRA, ServiceNow
- Handle production support, release management, and chaos engineering
Requirements
- Multi-cloud experience (AWS, Azure, GCP)
- Terraform and Deployment Manager
- GitHub Actions and CI/CD tools
- Docker and Kubernetes
- Linux, Bash, Python scripting
- Monitoring and APM tools (Grafana, Dynatrace, Datadog, etc.)
- SLI/SLOs, error budgets, ITSM/ITIL, ticketing tools
Benefits
- comprehensive wellness benefits including health checks
- telemedicine
- insurance with top-ups
- elder care
- partner coverage or new parent support via flexible work
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
IaCTerraformDeployment ManagerCI/CDGitHub ActionsDockerKubernetesLinuxBashPython
Soft Skills
incident managementautomationproduction supportrelease managementchaos engineering