
Senior Site Reliability Engineer
CoderPad
full-time
Posted on:
Location Type: Remote
Location: United States
Visit company websiteExplore more
Salary
💰 $170,000 - $180,000 per year
Job Level
Tech Stack
About the role
- Design, operate, and evolve production infrastructure across AWS, GCP, Heroku, and Kubernetes.
- Own and improve monitoring, alerting, and SLOs for customer-facing services.
- Lead and participate in incident response, postmortems, and long-term remediation.
- Build and maintain infrastructure-as-code, CI/CD pipelines, and automation (Terraform, GitLab CI, Kubernetes tooling).
- Drive scalability, performance, and resilience across a real-time SaaS platform.
- Ensure security, patching, and operational hygiene across all environments.
- Partner with product and engineering teams to enable safe, fast, and reliable releases.
- Actively contribute to cost visibility and cloud optimization.
Requirements
- 5+ years of experience in SRE, DevOps, Platform Engineering, or Cloud Infrastructure roles.
- Strong experience with AWS and GCP, including networking, IAM, compute, and managed services.
- Hands-on experience running Kubernetes in production.
- Strong knowledge of Terraform or equivalent infrastructure-as-code tooling.
- Experience with CI/CD systems (e.g., GitLab CI or similar).
- Solid understanding of observability (metrics, logs, traces) using tools like Datadog, Prometheus, Grafana, or similar.
- Proficiency in at least one programming or scripting language (Go, Python, Node.js, or similar).
- Strong Linux and Bash skills.
- Experience operating high-availability, customer-facing SaaS platforms.
Benefits
- Meaningful work with high impact for a well-loved product
- Competitive, market-rate salaries
- Stock options with a 4-year vesting schedule
- Medical, dental, and vision insurance (90% covered for employees and dependents)
- Flexible Spending Account (FSA)
- 401K with profit sharing
- Unlimited paid time off with an expectation of taking 3 weeks annually in addition to 20 company holidays
- Remote-friendly environment with monthly WFH stipend
- Parental leave (primary: 16 weeks; secondary: 12 weeks)
- Short- and long-term disability and life insurance coverage
- Choice of laptop computer
- Internal mobility and growth opportunities
- And more…
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
AWSGCPKubernetesTerraformGitLab CIobservabilityLinuxBashGoPython
Soft Skills
leadershipincident responsecommunicationcollaborationproblem-solving