Minor Hotels Europe and Americas

RunOps / SRE Architect

Minor Hotels Europe and Americas

full-time

Posted on:

Location Type: Office

Location: New York City • New York • 🇺🇸 United States

Visit company website
AI Apply
Apply

Salary

💰 $86,129 - $127,189 per year

Job Level

Mid-LevelSenior

Tech Stack

AWSCloudEC2GrafanaPrometheusTerraform

About the role

  • Architect implement and maintain highly available fault-tolerant and DR-ready AWS environments ensuring continuous business operations
  • Understanding of microservice architecture
  • Design and develop RunOps frameworks for operations including monitoring incident response and automated recovery mechanisms
  • Automate provisioning and configuration using Terraform and CICD ensuring consistent secure and auditable deployments
  • Build end-to-end CICD pipelines for multi-environment deployments and infrastructure delivery
  • Implement SRE best practices focusing on reliability scalability and performance optimization across applications and infrastructure
  • Ensure data protection encryption and secure access control for S3 EC2 RDS and other private AWS resources
  • Provide technical leadership in root cause analysis RCA and post-incident reviews implementing automated remediation and self-healing solutions
  • Support BAU operations L2/L3 through automation operational tooling and continuous improvement

Requirements

  • 6 years of hands-on experience in Cloud Operations SRE DevOps with 5 years in AWS environments
  • Proven expertise in AWS services EC2 S3 RDS Lambda VPC CloudFront CloudWatch IAM Config Systems Manager
  • Strong understanding of High Availability Fault Tolerance and Disaster Recovery architectures in AWS
  • Hands-on experience with Terraform for IaC and CICD automation
  • Experience with observability tools New relic CloudWatch XRay OpenTelemetry Prometheus Grafana and automations
  • Solid understanding of encryption security best practices and least-privilege access S3 bucket policies KMS IAM roles
  • Knowledge of DevOps and SRE principles automation reliability incident management and performance tuning
  • Familiarity with AIOps and automated alert correlation is a plus
  • Excellent communication and collaboration skills to work with multidisciplinary teams and leadership
Benefits
  • Flexible work
  • Healthcare including dental, vision, mental health, and well-being programs
  • Financial well-being programs such as 401(k) and Employee Share Ownership Plan
  • Paid time off and paid holidays
  • Paid parental leave
  • Family building benefits like adoption assistance, surrogacy, and cryopreservation
  • Social well-being benefits like subsidized back-up child/elder care and tutoring
  • Mentoring, coaching and learning programs
  • Employee Resource Groups
  • Disaster Relief

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
AWSTerraformCICDmicroservice architectureSREHigh AvailabilityFault ToleranceDisaster Recoveryencryptionincident management
Soft skills
communicationcollaborationtechnical leadership