Salary
💰 $124,364 - $185,545 per year
Tech Stack
AWSElasticSearchGoGoogle Cloud PlatformGrafanaPrometheusPythonRuby
About the role
- Responsible for the reliability of our SaaS product and infrastructure supporting production quantum computers
- Refine and evolve monitoring systems covering AWS, GCP, on-premises, and remote systems
- Work with software and hardware engineering to elicit requirements and provide dashboards
- Own the alerting with SREs to support infrastructure and on-call management systems
- Collaborate with DevOps and Test Engineering teams to ensure reliability throughout the software development lifecycle
Requirements
- 4+ years of experience operating and troubleshooting SaaS/PaaS applications on AWS/GCP
- Experience with high level SRE work including incident management and process design
- 4+ years of experience with AOS/Elasticsearch/Loki or similar
- Experience with time series databases like Prometheus/InfluxDB
- Proficiency in InfluxQL and PromQL
- Significant expertise with analytics and monitoring systems such as ELK, Grafana, etc.
- At least two years of programming experience in Python, Go, Bash, Ruby, or equivalent
- Degree in Computing Science, Engineering or equivalent education and experience
- Company ownership
- Competitive pay
- Meaningful benefits
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
SaaSPaaSAWSGCPSREincident managementAOSElasticsearchLokitime series databases
Soft skills
collaborationcommunicationproblem-solvingrequirements elicitationprocess design