Catio

Senior SRE

Catio

full-time

Posted on:

Location Type: Remote

Location: Remote • 🇺🇸 United States

Visit company website
AI Apply
Apply

Job Level

Senior

Tech Stack

AWSCloudGrafanaKubernetesPrometheusSplunkTerraform

About the role

  • Establish the foundations of AWS-based cloud operations and infrastructure-as-code strategy.
  • Design, implement, and administer secure, scalable, and cost-effective AWS infrastructure.
  • Develop infrastructure-as-code using tools like Terraform or Helm to manage and evolve cloud environments.
  • Define and deploy observability pipelines and dashboards across metrics, logs, and traces (CloudWatch, Prometheus, Grafana, etc) with Splunk being our preference and the tool of choice.
  • Write internal documentation and structured reports on architectural decisions and infrastructure health.
  • Collaborate with the product and engineering teams to align infrastructure capabilities with evolving product needs.
  • Operate independently and propose scalable, secure, and production-ready solutions with minimal guidance.

Requirements

  • 5+ years of experience in SRE, DevOps, or Cloud Infrastructure roles with a strong AWS and Kubernetes focus.
  • Advanced expertise in cloud architecture design and administration of core AWS services (VPC, IAM, ECS/EKS, RDS, CloudWatch, etc.).
  • Strong understanding of monitoring, logging and observability frameworks (preferably Splunk) and ability to set up custom dashboards.
  • Ability to analyze current traffic patterns and make technical recommendations for infrastructure choice appropriate for the business context.
  • Proficient in infrastructure-as-code frameworks such as Terraform, Pulumi, or AWS CDK.
  • Proven track record of owning production infrastructure and driving operational excellence at high-growth startups or SaaS companies.
  • Experience setting up and managing CI/CD pipelines and security best practices in cloud environments.
  • Excellent communication skills with the ability to distill complex infrastructure topics into clear written reports and dashboards.
  • Self-starter mindset and thrives in fast-paced, early-stage environments with limited structure.
Benefits
  • top-tier compensation for startups
  • significant equity in a rapidly growing, VC-backed company
  • commitment to fostering an inclusive and diverse workplace

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
AWSKubernetesTerraformHelmCloudWatchPrometheusGrafanaSplunkCI/CDinfrastructure-as-code
Soft skills
communicationcollaborationself-starteranalyticalproblem-solvingoperational excellencedocumentationindependenceadaptabilitytechnical recommendations
Hypergiant

Intermediate DevOps Engineer

Hypergiant
Mid · Seniorfull-time$113k–$136k / year🇺🇸 United States
Posted: 3 hours agoSource: boards.greenhouse.io
AnsibleAWSCloudDockerFluxGoogle Cloud PlatformJavaScriptKubernetesNode.jsReactTerraformTypeScript
The Voleon Group

Senior Site Reliability Engineer

The Voleon Group
Seniorfull-time$205k–$235k / yearCalifornia · 🇺🇸 United States
Posted: 3 hours agoSource: jobs.lever.co
AnsibleAWSCloudGoogle Cloud PlatformGrafanaPrometheusPythonRubyTerraform
Domyn

Senior DevOps Engineer

Domyn
Seniorfull-time🇺🇸 United States
Posted: 5 hours agoSource: apply.workable.com
AWSAzureCloudDockerGoogle Cloud PlatformJavaJavaScriptKubernetesLinuxPostgresPythonTerraform
Acquisition.com

Senior DevOps Engineer

Acquisition.com
Seniorfull-time$171k–$209k / yearArizona, California, Florida, Maryland, Minnesota, Missouri, Nevada, Ohio, Oregon, Pennsylvania, Tennessee, Texas, Utah, Wisconsin · 🇺🇸 United States
Posted: 17 hours agoSource: jobs.ashbyhq.com
AWSCloudDockerGoKubernetesPrometheusPythonTerraformTypeScript