Commify

Site Reliability Engineer

Commify

full-time

Posted on:

Origin:  • 🇬🇧 United Kingdom

Visit company website
AI Apply
Apply

Salary

💰 £75,000 - £90,000 per year

Job Level

Mid-LevelSenior

Tech Stack

AnsibleAzureChefCloudDNSGrafanaJenkinsKubernetesPuppetPythonRubyTerraform

About the role

  • Ensure products and platforms perform optimally by understanding software interactions with physical and Cloud infrastructure
  • Maintain high levels of system performance through monitoring and performance tuning
  • Implement scalability and fault tolerance
  • Automate processes and improve operational efficiencies
  • Troubleshoot application and middleware challenges
  • Collaborate with engineering teams to support high-throughput production environments
  • Build and maintain robust deployment pipelines

Requirements

  • Proficiency with Microsoft Azure
  • Strong expertise in Terraform, App Services, and Kubernetes
  • Fluent in both written and spoken English
  • Passion for reliability in systems
  • Experience in creating and modifying Terraform deployments
  • Prior experience in an operations role, ideally as a Site Reliability Engineer
  • Ability to work cross-functionally, take ownership, and prioritise effectively
  • Excellent communication and collaboration skills
  • Experience with monitoring solutions (e.g., Datadog, Azure Application Insights, Log Analytics)
  • Programming/scripting skills for automation (PowerShell preferred; Bash, C#, Ruby, or Python also acceptable)
  • Experience with web-based applications
  • (Desirable) Familiarity with Azure DevOps pipelines
  • (Desirable) Experience with Microsoft Server Operating Systems
  • (Desirable) Understanding of service level objectives and operational requirements for cloud-based solutions
  • (Desirable) Comprehensive knowledge of Microsoft Azure Cloud offerings (especially in PaaS)
  • (Desirable) Experience with tools such as Terraform, Ansible, VSTS, ARM, Puppet, Chef, Jenkins, ELK, Grafana
  • (Desirable) Understanding of DNS, Load Balancer configuration, Active Directory, and cloud network infrastructure
  • (Desirable) Experience in agile environments and methodologies (TDD, Scrum, Kanban)
  • (Desirable) Knowledge of monitoring and alerting systems for microservice architectures
  • (Desirable) Applied knowledge of cloud security best practices