66degrees

Site Reliability Engineer

66degrees

full-time

Posted on:

Location Type: Remote

Location: Remote • 🇺🇸 United States

Visit company website
AI Apply
Apply

Job Level

Mid-LevelSenior

Tech Stack

CloudKubernetesLinuxPythonSDLCSQLTerraform

About the role

  • Ensuring near-zero downtime with monitoring and alerting, self-healing automation, and continuous improvement
  • Create highly automated, available and scalable systems by applying software and infrastructure principles
  • Employ and advise clients on DevOps and SRE principles and practices, covering deployment pipelines, HA, service reliability, technical debt, and operational toil for live services running at scale
  • Provide a proactive approach to our clients’ workloads, anticipating failures, automating tasks, ensuring availability, and providing a great customer experience
  • Work closely with clients, your team, and Google engineers to investigate and resolve infrastructure issues
  • Manage a Jira queue of inbound requests for numerous clients while effectively balancing and prioritizing projects
  • Contribute to ad-hoc initiatives such as writing documentation, open-sourcing, and improving operation, making a huge impact at a rapid-growth Google Premier Partner

Requirements

  • Minimum 4+ years of cloud and infrastructure experience, including demonstrated expertise with Linux, Windows, k8s, databases, and networking services
  • 2+ solid years of full-time Google Cloud experience preferred
  • Proficiency with Python required. Other programming language experience is a plus
  • Strong provisioning and configuration skills using Terraform
  • Experience in troubleshooting that spans systems, network, and code
  • Microsoft Server and SQL Server experience is a plus but not required
  • Experience with 24x7x365 monitoring, incident response, and on-call support preferred
  • Experience determining & negotiating Error budgets, SLIs, SLOs, and SLAs with product owners
  • Demonstrate the ability to work independently and as a member of a greater team, including cross-team activities
  • Experience working in Agile Scrum, Kanban methodologies in SDLC
  • Proven experience balancing service reliability, metrics, sustainability, technical debt, and operational toil for live services running at scale
  • Strong communication skills, as this is a heavily customer-facing role
  • A Bachelor’s degree in Computer Science, Computer Engineering, or related or equivalent work experience required.
Benefits
  • None specified 📊 Resume Score Upload your resume to see if it passes auto-rejection tools used by recruiters Check Resume Score

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
LinuxWindowsKubernetesdatabasesnetworking servicesPythonTerraformmonitoringincident responseAgile Scrum
Soft skills
strong communicationindependent workteam collaborationproject prioritizationcustomer experience
Certifications
Bachelor’s degree in Computer ScienceBachelor’s degree in Computer Engineering
Perforce Software

Senior DevOps Engineer

Perforce Software
Seniorfull-time$110k–$120k / yearMinnesota · 🇺🇸 United States
Posted: 2 hours agoSource: jobs.lever.co
AnsibleAWSAzureChefCloudDockerGoGoogle Cloud PlatformJavaKubernetesPrometheusPuppet+5 more
Nuvem

Security DevOps Engineer

Nuvem
Mid · Seniorfull-time$85k–$125k / year🇺🇸 United States
Posted: 7 hours agoSource: recruiting.paylocity.com
AzureCloudFirewallsPythonSDLCTerraformVault
Centene Corporation

Lead Site Reliability Engineer, M365

Centene Corporation
Seniorfull-time$101k–$187k / yearIllinois · 🇺🇸 United States
Posted: 16 hours agoSource: centene.wd5.myworkdayjobs.com
SDLCSplunk
CFA Institute

DevOps Engineer

CFA Institute
Mid · Seniorfull-time🇺🇸 United States
Posted: 17 hours agoSource: corporatefinanceinstituteinc.bamboohr.com
AzureCloudDockerGrafanaKubernetesMySQLPrometheusPythonTerraform