
Site Reliability Engineer
Learning Technologies Group plc
full-time
Posted on:
Location Type: Hybrid
Location: Franklin • Tennessee • United States
Visit company websiteExplore more
Tech Stack
About the role
- Assist in the improvement of internal automation specifically through the use of “Infrastructure as Code” tools
- Assist in the maintenance of and addition of new features to the infrastructure control plane
- Act as the primary contact to monitor, troubleshoot, and resolve production issues as part of an on-call rotation of roughly one week per month to adhere to a 24/7/365 SLA
- Collaborate with the Director of DevSecOps as well as other members of the SRE team to explore, plan, and implement or improve the security posture, reliability, performance, and cost of hosted resources
- Collaborate with one or more product development teams on application development direction as relates to aspects of deployment and operational factors
- Collaborate with the members of the support and integration teams to assist with the support of customer environments as relates to deployment and operational factors
- Continuously improve knowledge of best practices in site reliability and technical skills related to security, automation, networking, and system operations
Requirements
- Extensive experience of one or more Unix CLIs including tools such as zsh/bash, non-graphical text editor(s), git, and various other common shell utilities for system administration
- Extensive experience using AWS resources in the deployment of highly available web application platforms
- Experience with implementing Infrastructure as Code
- Experience with the application of security tools, controls, and policies
- Experience with CI/CD pipeline configuration and orchestration
- Experience with deployment and orchestration of containerized resources (Docker, Kubernetes, etc.)
- Broad knowledge of AWS resources including EC2, ECS, S3, RDS, CloudFront, Elasticache, SQS, Route53, ELB, Lambda, etc.
- Broad knowledge of DNS, CDNs, load balancers, web servers, application servers, databases (MySQL), and networking concepts like the basics of TCP/UDP, RFC 1918 subnets, and NAT
- Familiarity with Terraform, CloudFormation, Ansible or similar tools
- Familiarity with GitHub Actions, Jenkins, or other automated task management tools
- Familiarity with web application (HTML, CSS) development using contemporary frameworks in Java, Python, JavaScript/TypeScript, or similar particularly in extensible, scalable, performant, and secure implementations
- Familiarity with non-web based scripting language(s) such as Python, JavaScript (Node.js), Go, etc.
Benefits
- Flexible work environment: Rustici Software offers the best of all worlds when it comes to where you work. Remote from your home office, a private office in Franklin, TN if you prefer, or a mix of both.
- Untracked PTO
- Medical, Dental, and Vision insurance
- HSA and FSA plans
- Short-term and Long-term disability
- Company paid life insurance
- 401k/Retirement vesting+matching on day 1
- Performance-based bonuses
- Office perks: Concierge services, gym equipment, yoga room and stocked kitchen with snacks and drinks
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Infrastructure as CodeUnix CLIAWSCI/CDDockerKubernetesTerraformCloudFormationAnsiblePython
Soft Skills
collaborationtroubleshootingmonitoringproblem-solvingcommunicationcustomer supportcontinuous improvement