Cognitiv

Senior Site Reliability Engineer

Cognitiv

full-time

Posted on:

Origin:  • 🇺🇸 United States • Washington

Visit company website
AI Apply
Apply

Salary

💰 $160,000 - $210,000 per year

Job Level

Senior

Tech Stack

AnsibleAWSCloudEC2KubernetesPrometheusPythonTerraform

About the role

  • Design, implement, and maintain infrastructure across datacenters and hybrid cloud deployments
  • Assess physical and network architectures for scalability, reliability, and cost efficiency
  • Improve monitoring, incident management, and disaster recovery (including migration to Datadog)
  • Implement infrastructure as code with Terraform and Ansible, automate using Python and Bash
  • Monitor and maintain shared infrastructure in AWS ensuring availability and stability
  • Work closely with engineering and product teams to scope projects to core business requirements
  • Lead major service management initiatives and drive long-term roadmap

Requirements

  • 7+ years in operations, engineering, or SRE with expertise in multi-datacenter deployments
  • Deep knowledge of AWS infrastructure, networking, and service management practices
  • Skilled in infrastructure as code and automation, with proficiency in Python and Bash
  • Experience with Terraform and Ansible
  • Some Kubernetes experience
  • Willingness to travel 1–2 times per quarter for deployments (bonus)
  • Self-starter with strong communication and collaboration skills