Salary
💰 $141,000 - $216,000 per year
Tech Stack
AWSCloudDynamoDBGoIoTKubernetesPythonRubyTerraform
About the role
- As a Staff Site Reliability Engineer on the Platform team, design, scale, and manage AWS-backed services for millions of connected IoT devices, mobile, and SaaS users.
- Ensure services and new products/features operate smoothly and meet Motive’s 99.99% availability SLOs.
- Collaborate with engineering and product teams to design and build the infrastructure and services required to deliver new features in a cloud-native and event-driven fashion.
- Leverage and progress IaC (Terraform) and CM (Helm) code and strategies for advanced scaling and self-service usage by engineering teams.
- Identify and remove bottlenecks from systems in production across AWS services and Kubernetes platform.
- Continuously improve monitoring and alerting capabilities to enable proactive operations.
- Perform advanced troubleshooting with teams.
Requirements
- 5+ years of professional SRE/DevOps experience, and a demonstrated ability working on high volume production systems
- Demonstrable systems architect expertise, solving complex technical problems and implementing company wide solutions.
- Advanced knowledge of AWS services and technologies (ALB/ELB, IAM permissions, DynamoDB, SNS, EKS/Fargate, etc.)
- Experience with infrastructure as code and configuration management (Terraform and Helm charts especially) to design and provision new services
- Knowledge of Python, Bash or other scripting languages. Knowledge of Ruby or Golang is a plus.
- High-level of ownership and drive to work with others and see improvements through to production.
- The applicant must be authorized to receive and access those commodities and technologies controlled under U.S. Export Administration Regulations.