Proactively monitoring and managing our AWS/Cloud production environments and reacting swiftly to prevent or reduce customer visible impact
Escalation and communication of production issues to key stakeholders
Troubleshooting, reproducing, and mitigating complex system and infrastructure issues within the AWS environment
Incident management of high severity issues impacting our sites and services 24x7
Developing and implementing automation and tooling (e.g., leveraging CloudFormation, Ansible, or Terraform) in collaboration with the Site Reliability Engineering team to improve cloud management processes
Working on the engineering team backlog
Supporting service prior to go-live through pre-launch reviews
Providing technical support for internal products, requiring strong investigation, analysis, and resolution skills
Monitoring and checking of systems
Execution of daily system operations tasks, including maintenance and optimisation of the cloud infrastructure
Deployment, configuration, and management of cloud-native or updated solutions, utilizing Gitlab CICD
As a member of our Technology team, you will be working closely with the Development team and business units to deliver a high level of support and service
Requirements
Strong troubleshooting, problem-solving, and investigative skills applied across diverse operating systems and networked environments
Extensive experience operating and managing critical cloud infrastructure and production environments
Experience of working in an agile environment to deliver software
Knowledge and practical experience with scripting (including Shell scripting and Python) for automation and system management
Experience working in a Microsoft stack environment including Windows Server Operating system, Internet Information Services (IIS), Active Directory (AD) and database servers such as Microsoft SQL Server.
Operating knowledge of UNIX or Linux, including proficiency in Shell scripting and Python scripting for automated task scheduling and infrastructure management
Demonstrated experience with Amazon Web Services (AWS), specifically in managing core services (e.g., EC2, VPC, S3)
Sound knowledge of basic networking such as IPs, TCP/IP and Firewall
Proficient in quickly learning new technologies and ability to analyze business needs and recommend effective solutions
Excellent verbal and written communication skills
Benefits
Complimentary event tickets
Birthday and volunteering leave
Wellbeing discounts & flu vaccinations
Paid parental leave & free employee support (EAP)
Global rewards and recognition
Learning, development & career pathways
A diverse, inclusive, and passionate team
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.