Cognitiv

Senior Site Reliability Engineer

Cognitiv

full-time

Posted on:

Location Type: Hybrid

Location: Bellevue • Washington • 🇺🇸 United States

Visit company website
AI Apply
Apply

Salary

💰 $160,000 - $210,000 per year

Job Level

Senior

Tech Stack

AnsibleAWSCloudEC2KubernetesPrometheusPythonTerraform

About the role

  • Expand global footprint of datacenters and improve service management across Cognitiv
  • Rapidly expand hybrid cloud infrastructure and design, implement, maintain infrastructure across datacenters and hybrid cloud deployments
  • Assess physical and network architectures for scalability, reliability, and cost efficiency
  • Improve monitoring, incident management, and disaster recovery, including migration to Datadog
  • Implement and own Infrastructure as Code and automation using Terraform, Ansible, Python, and Bash
  • Monitor and maintain shared infrastructure in AWS ensuring availability and stability
  • Collaborate with engineering and product teams to tightly scope projects to core business requirements
  • Lead major service management initiatives and drive long-term engineering practice improvements

Requirements

  • 7+ years in operations, engineering, or SRE with multi-datacenter experience
  • Deep knowledge of AWS infrastructure and networking
  • Expertise in service management practices
  • Skilled in Infrastructure as Code (Terraform) and automation (Ansible)
  • Proficiency in Python and Bash scripting
  • Experience with Datadog and Prometheus monitoring tools (Datadog migration)
  • Experience with Kubernetes, EC2, and bare metal deployments
  • Self-starter with strong ownership and problem-solving abilities
  • Strong communication and collaboration skills across functions
  • Bonus: hybrid cloud/on-prem solutions experience
  • Bonus: hands-on datacenter buildout experience
  • Bonus: willingness to travel 1–2 times per quarter for deployments
Benefits
  • Medical, dental & vision coverage (some plans 100% employer-paid)
  • 12 weeks paid parental leave
  • Unlimited PTO
  • Work-From-Anywhere August
  • Career development with clear advancement paths
  • Equity for all employees
  • Hybrid work model (3 days in-office, 2 days remote)
  • Daily team lunch
  • Health & wellness stipend
  • Cell phone reimbursement
  • 401(k) with employer match
  • Parking (CA & WA offices)
  • Pre-tax commuter benefits
  • Employee Assistance Program
  • Comprehensive onboarding (Cognitiv University)
  • Cross-team games, events, and creative team bonding

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
Infrastructure as CodeTerraformAnsiblePythonBashAWSKubernetesEC2DatadogPrometheus
Soft skills
problem-solvingownershipcommunicationcollaboration