
Site Reliability Engineer
BAE Systems Digital Intelligence
full-time
Posted on:
Location Type: Hybrid
Location: Gloucester • United Kingdom
Visit company websiteExplore more
About the role
- Supporting and maintaining essential service that support core mission applications
- Proactively enhancing their availability, performance and stability
- Finding innovative solutions to problems rather than undertaking repetitive work, automating everything you can
- Advising product teams of good practice in how to design and build systems
- Instrumenting applications where they don’t have sufficient monitoring in place
- Participating in the wider DevOps/SRE community within the organisation
Requirements
- Experience in software development in Java and web technologies, e.g. JavaScript and HTML
- Familiarisation with database technologies such as Elastic, Mongo
- Knowledge of Linux and Windows command lines, e.g. Bash and PowerShell
- Hands-on experience with cloud infrastructure such as AWS, Azure or OpenStack
- Use of deployment tools such as chef and puppet
- Expertise in monitoring large systems using technologies such as ELK
- Experience of working in an Agile scrum team, and the tooling that supports it, e.g. Jira
- Diagnosing and troubleshooting application issues resulting in service outages
- Troubleshooting skills across different levels of the stack
- Understanding of ITIL terminology
- Experience with container management and micro-services architectures such as Docker
- Familiarisation with automation test frameworks such as Selenium
- Awareness and insight into technology trends to adopt new cutting edge tools
Benefits
- Hybrid and flexible working arrangements
- Community engagement and outreach activities
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
JavaJavaScriptHTMLElasticMongoLinuxWindowsAWSAzureDocker
Soft Skills
problem solvinginnovationadvisingcollaborationtroubleshooting