Dayshape

Principal Site Reliability Engineer

Dayshape

full-time

Posted on:

Location Type: Hybrid

Location: EdinburghUnited Kingdom

Visit company website

Explore more

AI Apply
Apply

Salary

💰 £74,000 - £85,000 per year

Job Level

About the role

  • Support day-to-day production operations and BAU reliability work across our Azure environments
  • Investigate and resolve incidents, service degradations, and operational issues in live systems
  • Troubleshoot application and infrastructure issues across Azure compute, networking, and data services
  • Improve Azure monitoring, alerting, and observability to reduce noise and improve detection
  • Assist with root cause analysis and follow-up remediation actions
  • Support environment management, configuration changes, and production deployments
  • Help reduce operational toil through scripting, automation, and small tooling improvements
  • Contribute to maintaining operational documentation, runbooks, and support knowledge
  • Work with product and engineering teams to ensure services meet operational readiness standards
  • Support ongoing reliability, performance, and stability improvements across the Azure platform
  • Partner with engineering teams to support platform adoption, drive best practices and troubleshoot infrastructure issues.
  • Mentor junior engineers and promote best practices in infrastructure automation and cloud architecture.
  • Define comprehensive technical plans with the technical lead supporting wider team delivery.
  • Contribute to team planning helping to align it with the strategic goals of the group.

Requirements

  • Significant experience as a Site Reliability Engineer at a Principal level
  • Proven experience operating and supporting production systems in Microsoft Azure
  • Very confident troubleshooting live cloud systems and resolving operational issues
  • Strong experience with Azure services such as App Services, VMs, networking, storage, and Azure SQL
  • Worked closely with Azure monitoring and observability tools (e.g., Azure Monitor, Log Analytics, Application Insights)
  • Comfortable setting technical direction with technical leads
  • Experience with incident response and root cause analysis
  • You are comfortable working across application, infrastructure, and cloud platform layers
  • Pragmatic, delivery-focused, and able to pick up ongoing operational work quickly
  • You have experience supporting .NET applications and Azure SQL Server
  • Strong collaborator and communicate clearly with both engineers and stakeholders.
Benefits
  • At least £1,000 per year to spend on professional and personal development
  • 33 days' holiday per year (including bank holidays), increasing by 1 day each year to a maximum of 40 days
  • Paid four week sabbatical in your fifth anniversary year on top of your holiday entitlement
  • Enhanced family leave policies
  • Private medical insurance, including dental and vision benefits
  • Income protection and death in service cover
  • Matched 5% auto-enrolment workplace pension scheme
  • Access to wellbeing offerings, such as our Employee Assistance Programme and a dedicated counselling service
  • Volunteering time – up to 20 hours a year to participate in volunteer work
  • Regular All Hands meeting for inspiration and over-communication
  • Time out of the working week for team socials each month, with a mix of in-person and virtual options: past events include hiking, family BBQs, board games and at-home cocktail classes!
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Site Reliability Engineertroubleshootingincident responseroot cause analysisAzure servicesAzure App ServicesAzure VMsAzure SQLinfrastructure automationcloud architecture
Soft Skills
collaborationcommunicationmentoringdelivery-focusedpragmatictechnical direction settingteam planningoperational work managementbest practices promotionsupport knowledge maintenance