
Site Reliability Engineering Specialist
BT Group
full-time
Posted on:
Location Type: Office
Location: Birmingham • 🇬🇧 United Kingdom
Visit company websiteSalary
💰 £NaN per year
Job Level
Mid-LevelSenior
Tech Stack
AnsibleCloudKafkaLinuxPrometheusPythonTCP/IP
About the role
- Be a subject matter expert within the network engineering domain, accountable for the introduction of major end-to-end technology introduction and final line fault resolution
- Take the lead on complex and high impact fault resolution across multiple platforms and services
- Promote a culture of continuous improvement within a team of engineering professionals, contributing to recruitment and the resource strategy
- Use in-depth technical knowledge and operational understanding to identify and deliver improvements in service availability through end to end business ownership
- Implement flawless change, providing technical resolution and constantly learning, allowing resource flexibility across the unit to meet demand requirements & deliver “Brilliant” customer experience
- Champion and build effective working relationships, both internally and externally to deliver business outcomes, adding value throughout
- Be responsible for the change execution and assurance of Fixed Networks platforms managed by Professional Services
- Drive the publicity of the team through your contribution as a technically talented expert, challenging and supporting those around you to be accountable and perform at their best
- Contribute to improving reliability, scalability, and performance of services by driving automation, monitoring, and operational excellence across a number of environments
- Support the delivery of routine daily activities and be accountable for system design, build, testing, validation, maintenance, and ongoing support of all network infrastructure components
- Participate and be the technical lead on incidents within immediate domain and be part of PIR write up and solution
- Propose and implement innovations for continuous improvement within professional services
- Collaborate with design & platform teams and support the Implementation of flawless change into the live network, utilising automation and CI/CD pipelines
- Configure and maintain monitoring solutions such as ELK stack, Prometheus and Kafka to ensure the health and performance of our systems
- Be prepared to support a 365x24/7 callout, providing third line technical resolution covering an extensive range of technologies
Requirements
- Fully conversant and comply with Change Management processes and the supporting governance
- Confident and professional in communicating with all stakeholders, both locally and with members of the Senior Management Team
- Extensive experience of network technologies and protocols within your native domain
- Strong understanding of the E2E service journeys
- Ability to adapt to emerging and evolving technologies, in line with the Technology Career Map
- Ability to work in a high-pressure environment
- Self-driven attitude towards learning new skills and aiding the development of others
- Strong system administration skills, across an array of operating systems such as linux, windows etc. with a focus on performance optimisation, security, and automation
- Strong understanding cloud computing and best practices
- Proficiency in at least one programming language, such as Python or Ansible, as well as a familiarity with CI/CD orchestration
- Incident response to resolution
- Advocated and applied technology where it has not been used previously
- Delivered results when the deliverables were unclear
- Worked in a high pressured operational environment
- Experience with IP networks, with a strong understanding of various TCP/IP protocols
- Expertise in CI/CD pipelines and associated tools (e.g. GitLab CI)
- Proven experience in a fast-paced ISP setting, managing and troubleshooting large-scale networks
- Experience with Linux system administration to a high standard with a focus on performance optimisation, security, and automation
- Experience maintaining and deploying cloud computing infrastructure
- Strong experience developing workflow automation using programming languages such as Python or Ansible
- Experience in completing high impact changes in a live network and system environment
- Experience in learning new technologies and driving the development of others
Benefits
- Tailored training and development opportunities to continue to build your career
- 10% on target bonus
- 25 days’ annual leave (not including bank holidays), increasing with service
- Life Assurance
- Pension scheme - If you pay in a minimum of 5% of your pensionable salary every month we will pay in 10%
- Direct Share scheme
- Option to join the Healthcare Cash Plan or other benefits such as dental insurance, gym memberships etc.
- 50% off EE mobile pay monthly or SIM only plans
- Exclusive colleague discounts on our latest and greatest BT broadband packages BT TV, including TNT Sports and NOW entertainment
- Shared Parental leave - maximum amount of leave you can share with your partner is 50 weeks
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
network technologiesTCP/IP protocolsLinux system administrationcloud computingprogramming languagesPythonAnsibleCI/CD pipelinesautomationperformance optimisation
Soft skills
communicationleadershipadaptabilityself-drivenproblem-solvingteam collaborationstakeholder engagementcontinuous improvementhigh-pressure environmentmentoring