Tech Stack
AWSAzureCloudGrafanaJenkinsNoSQLSQLTerraform
About the role
- Work as an SRE engineering lead to identify, measure, and prioritize remediation and modernization of highest toil/risk technical debt
- Research products for capacity concerns and resolve capacity issues
- Set up a Load Test environment and continue build out of monitoring system in OpenSearch, Grafana, and Jenkins
- Move logging out of SQL Server where it still exists
- Migrate systems to cloud-native platforms (AWS, Azure) and implement containerization
- Review and recommend architecture, implementation, and remediation with Agile teams; potentially serve as System Team Architect within ATL architectural practice
- Define and monitor SLIs, SLOs, and SLAs; develop and manage error budgets
- Reduce toil and improve system resilience through automation and efficiency improvements
- Partner with internal teams, external customers, and third-party vendors to design and deliver reliable technical solutions
Requirements
- BS in Computer Science, Computer Engineering, Software Engineering or equivalent field of study or equivalent job experience
- 8+ years' experience developing and delivering software assets utilizing web application frameworks, object-oriented languages, functional programming techniques, design patterns, integration strategies, and secure coding practices
- 8+ years' experience developing scalable web applications and web-services (preferably REST); understanding of Asynchronous Services, Event Driven Architecture, Encapsulation, and Architectural Partitioning
- Expertise with DevOps practices such as CI/CD and mature test coverage practices
- Ability to understand code complexity and comprehensibility scans and successfully refactor for simplification and understandability
- Expert understanding of cloud-based platforms such as AWS, Google, or Azure and respective service offerings (Serverless, Storage, Data)
- Expert understanding of security concepts such as OAUTH, SAML, OWASP top 10
- 8+ years' practical experience with relational and/or NoSQL databases; core concepts such as ACID transactions, data retrieval, search, and persistence paradigms
- Application performance tuning experience preferred
- Sound troubleshooting skills and techniques
- Strong attention to detail
- Ability to communicate with technical & non-technical personnel, both written and verbal
- Experience developing in a team environment
- Experience with Infrastructure as Code (IaC) practices; familiarity with Terraform (preferred), CloudFormation, or similar