
Senior Site Reliability Engineer
Clarifai
full-time
Posted on:
Location Type: Remote
Location: Canada
Visit company websiteExplore more
Job Level
About the role
- Ensure the smooth operation and high availability of Clarifai's core services
- Monitor system performance, identify bottlenecks, and implement optimizations to enhance reliability and efficiency
- Develop Kubernetes resources and custom tooling for seamless cloud and on-premise deployments
- Design and implement scalable, secure, and cost-effective infrastructure solutions.
- Partner with teams across the organization to identify & solve engineering challenges
Requirements
- BS/BA in Computer Science or related degree
- Good knowledge of cloud providers (AWS, GCP or similar)
- Expertise with Kubernetes (EKS, GKE, self-hosted) and Infrastructure as Code using Terraform, Helm
- Solid understanding of web and networking (HTTP, TLS, DNS, Certificates, etc)
- Experience with CI/CD pipelines using tools such as GitHub Actions, ArgoCD, and Atlantis
- Strong interpersonal skills working with teams across different time zones and regions
Benefits
- Health insurance
- Retirement plans
- Paid time off
- Flexible work arrangements
- Professional development
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
KubernetesInfrastructure as CodeTerraformHelmCI/CDGitHub ActionsArgoCDAtlantiscloud providersweb and networking
Soft skills
interpersonal skills
Certifications
BS in Computer ScienceBA in Computer Science