
Staff Platform Site Reliability Specialist, Observability – Kubernetes
Everbridge
full-time
Posted on:
Location Type: Remote
Location: Canada
Visit company websiteExplore more
Salary
💰 CA$135,000 - CA$165,000 per year
Job Level
Tech Stack
About the role
- Head the design, operation, and evolution of Everbridge’s observability stack
- Build and maintain a highly available, scalable observability platform
- Standardize instrumentation, dashboards, alerts, and SLOs
- Support incident response, root cause analysis, and capacity planning
- Operate and scale Grafana and technology
- Maintain reliability and security of EKS clusters running observability
- Manage cluster lifecycle and upgrades
- Terraform for infrastructure provisioning
- Gitlab CI/CD at Scale
Requirements
- 6+ years in SRE / Platform Engineering
- Strong Grafana ecosystem experience
- Kubernetes and Amazon EKS expertise
- Terraform proficiency
Benefits
- healthcare
- dental care
- mental health benefits
- disability income benefits
- life and AD&D insurance
- retirement savings plan with employer match
- paid time off
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
observabilityincident responseroot cause analysiscapacity planningGrafanaKubernetesAmazon EKSTerraformGitlab CI/CD