BT Group

Site Reliability Engineering Specialist

BT Group

full-time

Posted on:

Location Type: Office

Location: Bengaluru • 🇮🇳 India

Visit company website
AI Apply
Apply

Job Level

Mid-LevelSenior

Tech Stack

AWSAzureCloudGoogle Cloud PlatformJavaMicroservicesPythonSDLC

About the role

  • Executes the implementation of new software development life cycle automation tools, frameworks, and code pipelines
  • Coordinates a diverse team and creates the initial test schedule to deliver all aspects of testing
  • Executes the implementation of automation technologies to ensure repeatability
  • Proactively identifies and manages risk through regular assessment
  • Leads scale testing to measure, tune and optimize system performance
  • Executes metric/monitoring analysis that creates stability, security, and performance improvements
  • Designs, analyses, develops and troubleshoots highly distributed large-scale production systems
  • Executes approaches that scale systems sustainably through mechanisms like automation
  • Writes and delivers infrastructure as code software to improve availability and efficiency
  • Implements robust monitoring and alerting systems and performs root cause analysis
  • Inspects queue and support processing to ensure early warning of support issues
  • Executes retrospective and preventive actions after each high severity production incident
  • Analyses complex systems from a reliability and resilience perspective and identifies sources of instability
  • Champions and shares knowledge on emerging trends

Requirements

  • A degree in IT, Maths or Science
  • A deep understanding of full stack monitoring solutions such as Dynatrace
  • Strong proficiency in one or more programming languages (e.g. Java, Python)
  • Experience with cloud platforms (AWS, Azure, or GCP)
  • Solid understanding of software architecture, design patterns, and microservices
  • Familiarity with CI/CD tools and DevOps practices
  • High levels of quality presentation and reporting capabilities
  • Resilience to ensure support teams are engaged 24x7x365
  • Ability to adapt to latest industry trends
  • CI/CD/CT Pipeline management
  • Micro-Service functionality
  • Business Process Improvement
  • Growth mindset AI driven Observability & AIOps
  • AIOps fundamentals (cross domain telemetry ingestion, event correlation)
  • Agentic/autonomous observability skills (using intelligent agents)
  • AI assisted alerting & noise reduction
Benefits
  • Flexible work arrangements
  • Professional development

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
JavaPythonCI/CDDevOpsAIOpsInfrastructure as CodeMonitoring SolutionsMicroservicesCloud PlatformsSoftware Architecture
Soft skills
LeadershipRisk ManagementAdaptabilityPresentation SkillsReporting SkillsResilienceTeam CoordinationKnowledge SharingProblem SolvingAnalytical Thinking