
Senior Site Reliability Engineer
Satsuma Technology Ltd
full-time
Posted on:
Location Type: Remote
Location: Texas • United States
Visit company websiteExplore more
Job Level
About the role
- Own infrastructure across AWS, GCP, and Azure environments
- Build and maintain CI/CD pipelines, observability stacks, and incident response workflows
- Define and enforce SLOs/SLIs; lead postmortems
- Author and maintain IaC (Terraform preferred)
- Write internal tooling and automation using AI-assisted development workflows
- Partner closely with engineering on reliability reviews and architecture decisions
Requirements
- 5-8 years in SRE, DevOps, or infrastructure engineering
- Hands-on experience across at least two major cloud providers
- Strong Kubernetes, Terraform, and observability tooling (Datadog, Grafana, or equivalent)
- Comfortable reading and editing code; able to ship scripts and internal tools
- Experience with AI-assisted development (Copilot, Cursor, Claude Code)
- On-call maturity -- you've owned incidents end-to-end and made systems better afterward
- Prior experience at a startup or high-growth SaaS company
- Familiarity with API gateway infrastructure or commerce tech stacks
- Hands-on experience with MCP or agentic AI infrastructure
Benefits
- Unlimited PTO
- 401(K)
- Healthcare Stipend
- Gym stipend
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
AWSGCPAzureCI/CDTerraformKubernetesobservabilityAI-assisted developmentincident responseAPI gateway
Soft Skills
leadershipcollaborationproblem-solvingcommunicationon-call maturity