
Site Reliability Engineer
Planet
full-time
Posted on:
Location Type: Hybrid
Location: California • United States
Visit company websiteExplore more
Salary
💰 $172,200 - $215,200 per year
About the role
- Build and deploy computing services and infrastructure in customer environments for a next-generation satellite operations and image processing end-to-end platform
- Operate in a high-impact, tight knit team to architect novel systems for air-gapped deployments at scale
- Clarify and surface requirements from ambiguous use cases defined by cross-functional stakeholders, including internal users and external customers
- Responsible for operations such as deployments, service orchestration, and documentation for cross platform stakeholders
- Scale architecture while ensuring availability of services
- Improve reliability and scalability by resolving edge cases, studying failure modes, and writing tests
- Participate in on-call rotations to ensure operational excellence
Requirements
- Bachelor’s degree in Computer Science or similar
- 10+ years of experience building services that leverage cloud-native infrastructure and tooling
- Experience deploying and maintaining bare-metal and cloud kubernetes through tools such as Talos, RKE2, Proxmox, or k3s
- Proficiency with Terraform, Ansible, Helm, Kustomize, and/or similar IaC / GitOps tooling
- Experience successfully building, releasing, and supporting highly available, consistently performant services
- Knowledge of hardware and network level implications of on-prem compute
- Experience with platform optimization, particularly resource optimization, management, and cluster tuning in a constrained environment
- Ability to observe and troubleshoot distributed systems with tools such as Alloy, Prometheus, Grafana, and OpenTelemetry
- Advanced skills in Python, Bash, and other tooling as appropriate to build services and meet product goals
- Excellent communication skills and the ability to work through collaboration with cross-functional engineering teams
- Experience working with Jira for task management and progress tracking.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
cloud-native infrastructureKubernetesTerraformAnsibleHelmKustomizePythonBashservice orchestrationplatform optimization
Soft Skills
communicationcollaborationtroubleshootingproblem-solvingoperational excellence
Certifications
Bachelor’s degree in Computer Science