Salary
💰 £80,000 - £100,000 per year
About the role
- Own the operational performance of our Azure environments (Dev, QA, Stage, Prod)
- Ensure high availability, redundancy, and performance using Azure-native tools
- Manage integration runtimes and linked services
- Implement and maintain monitoring, alerting, and logging systems
- Drive automation across deployment pipelines and operational workflows
- Maintain and evolve infrastructure documentation and runbooks
- Enforce security best practices including MFA, time-based access controls, and role-based access
- Ensure compliance with ISO27001, SOC 2, and other relevant standards
- Manage responses to vendor security questionnaires and client-facing security documentation
- Partner with Engineering, Product, and Support teams to resolve incidents and improve platform reliability
- Support team scaling and rebalancing initiatives
- Lead incident response and root cause analysis for outages or performance issues
- Maintain SLA commitments and communicate service availability metrics
- Monitor and optimise Azure spend across environments
- Manage relationships with cloud service providers and ensure contract compliance
- Support the career development of the current team running and updating our service
- On-Call support rota management
- Weekly 1:1s with team members
- Hiring for team needs
Requirements
- Proven experience managing SaaS infrastructure at scale
- Experience with infrastructure-as-code, CI/CD pipelines, and cloud security
- Excellent communication and documentation skills
- Experience in team leadership or cross-functional collaboration
- Strong knowledge of Azure services: App Services, Data Factory, DevOps, Synapse, Defender (bonus)
- Experience with using Pulumi for deployment (bonus)