Salary
💰 $180,000 - $200,000 per year
Tech Stack
CloudDockerGoGoogle Cloud PlatformKubernetesMongoDBOpen SourcePostgresPythonRustShell ScriptingTerraformTypeScript
About the role
- Be part of the Cloud Platform SRE Team, focused on building our Cloud Platform using modern tools and best practices
- Champion SRE best practices within the team and throughout the organization
- Solve the whole problem: architecture for resiliency, identify risks, and make it happen
- Use a wide variety of open source technologies and tools
- Facilitate CI/CD practices in collaboration with Development, QA and Technical Operations teams
- Automate build, test, release/deploy, monitoring, and reporting; implement Infrastructure as Code
- Collaborate across functional borders and influence architectural decisions with focus on security, scalability, and high performance
- Understand backup/recovery systems, development automation, storage area networks and virtualization
- Be on an on-call rotation to respond to incidents and support internal or customer-facing incidents
- Develop understanding of near, mid, and long-term business needs and how Platform SRE contributes to success
- Mentor team members and provide leadership and guidance to multiply team output
Requirements
- 8+ years of professional DevOps / SRE experience
- 5+ years of coding (Python, Golang, TypeScript, Shell Scripting, Rust, etc.)
- Experience with Typescript is a plus
- Expert knowledge of Docker and Kubernetes
- Experience with Google Cloud Platform (or similar provider)
- Experience with Build and Deployment tools such as Terraform, CircleCI, ArgoCD
- Experience with multi-region support is a plus
- Experience with ELK Stack, MongoDB, Postgres
- Thorough understanding of backup/recovery systems, development automation routines, storage area networks and virtualization
- Understanding of Continuous Integration & Continuous Delivery practices
- Ability and enthusiasm for automating build, test, release/deploy, monitoring, reporting, including Infrastructure as Code
- Troubleshooting and analytical skills; ability to dive deep and learn a new product
- Strong technical aptitude and exceptional communication skills (written and verbal)
- Ability to appropriately prioritize and respond to different escalations
- Experience and desire to work cross-functionally; team and goal-oriented
- Willingness to be on an on-call rotation to respond to incidents
- High output; low ego