
Principal Site Reliability Engineer
Dayshape
full-time
Posted on:
Location Type: Hybrid
Location: Edinburgh • United Kingdom
Visit company websiteExplore more
Salary
💰 £74,000 - £85,000 per year
Job Level
About the role
- Support day-to-day production operations and BAU reliability work across our Azure environments
- Investigate and resolve incidents, service degradations, and operational issues in live systems
- Troubleshoot application and infrastructure issues across Azure compute, networking, and data services
- Improve Azure monitoring, alerting, and observability to reduce noise and improve detection
- Assist with root cause analysis and follow-up remediation actions
- Support environment management, configuration changes, and production deployments
- Help reduce operational toil through scripting, automation, and small tooling improvements
- Contribute to maintaining operational documentation, runbooks, and support knowledge
- Work with product and engineering teams to ensure services meet operational readiness standards
- Support ongoing reliability, performance, and stability improvements across the Azure platform
- Partner with engineering teams to support platform adoption, drive best practices and troubleshoot infrastructure issues.
- Mentor junior engineers and promote best practices in infrastructure automation and cloud architecture.
- Define comprehensive technical plans with the technical lead supporting wider team delivery.
- Contribute to team planning helping to align it with the strategic goals of the group.
Requirements
- Significant experience as a Site Reliability Engineer at a Principal level
- Proven experience operating and supporting production systems in Microsoft Azure
- Very confident troubleshooting live cloud systems and resolving operational issues
- Strong experience with Azure services such as App Services, VMs, networking, storage, and Azure SQL
- Worked closely with Azure monitoring and observability tools (e.g., Azure Monitor, Log Analytics, Application Insights)
- Comfortable setting technical direction with technical leads
- Experience with incident response and root cause analysis
- You are comfortable working across application, infrastructure, and cloud platform layers
- Pragmatic, delivery-focused, and able to pick up ongoing operational work quickly
- You have experience supporting .NET applications and Azure SQL Server
- Strong collaborator and communicate clearly with both engineers and stakeholders.
Benefits
- At least £1,000 per year to spend on professional and personal development
- 33 days' holiday per year (including bank holidays), increasing by 1 day each year to a maximum of 40 days
- Paid four week sabbatical in your fifth anniversary year on top of your holiday entitlement
- Enhanced family leave policies
- Private medical insurance, including dental and vision benefits
- Income protection and death in service cover
- Matched 5% auto-enrolment workplace pension scheme
- Access to wellbeing offerings, such as our Employee Assistance Programme and a dedicated counselling service
- Volunteering time – up to 20 hours a year to participate in volunteer work
- Regular All Hands meeting for inspiration and over-communication
- Time out of the working week for team socials each month, with a mix of in-person and virtual options: past events include hiking, family BBQs, board games and at-home cocktail classes!
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Site Reliability Engineertroubleshootingincident responseroot cause analysisAzure servicesAzure App ServicesAzure VMsAzure SQLinfrastructure automationcloud architecture
Soft Skills
collaborationcommunicationmentoringdelivery-focusedpragmatictechnical direction settingteam planningoperational work managementbest practices promotionsupport knowledge maintenance