Red Hat

Senior Site Reliability Engineer

Red Hat

full-time

Posted on:

Location Type: Remote

Location: North CarolinaUnited States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $111,260 - $183,580 per year

Job Level

About the role

  • Contribute code to increase the scalability and reliability of the service
  • Contribute software tests and participate in peer review to increase the quality of our codebase
  • Help and develop peers’ capabilities through knowledge sharing, mentoring, and collaboration
  • Participate in a regular on-call schedule, including occasional paid weekends and holidays
  • Practice sustainable incident response and blameless postmortems
  • Resolve customer issues escalated from the Red Hat Global Support team
  • Work within a small agile team to develop and improve SRE software, support your peers, plan and self-improve
  • Collaborate with cross-functional teams to identify opportunities for AI integration within the software development lifecycle

Requirements

  • A bachelor's degree in Computer Science or a related technical field involving software or systems engineering is required
  • Experience programming in at least one of these languages: Python, Golang, Java, C, C++ or another object-oriented language
  • Experience working with public clouds such as AWS, GCP, or Azure
  • Ability to collaboratively troubleshoot and solve problems in a team setting
  • Experience troubleshooting an as-a-service offering (SaaS, PaaS, etc.)
  • Experience working with complex distributed systems
  • Direct experience with Kubernetes or OpenShift is a plus
  • A demonstrated ability to debug, optimize code and automate routine tasks
  • A basic understanding of Unix/Linux operating systems
  • 5+ years of experience managing Linux servers running Red Hat Enterprise Linux (RHEL), CentOS, or Fedora hosted at a cloud provider
  • 3+ years of experience with enterprise systems monitoring; knowledge of Prometheus is a plus
  • 3+ years of experience with enterprise configuration management software like Ansible, Puppet, or Chef
  • 2+ years of experience programming with at least one object-oriented language; Golang, Java, or Python are preferred
  • 2+ years of experience delivering a hosted service
  • Demonstrated ability to quickly and accurately troubleshoot system issues
  • Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP
  • Solid communications skills and experience working directly with and presenting to customers
  • 1+ year(s) of experience with Kubernetes is a plus
  • 1+ year(s) of experience with docker-based containers is a plus
Benefits
  • Comprehensive medical, dental, and vision coverage
  • Flexible Spending Account - healthcare and dependent care
  • Health Savings Account - high deductible medical plan
  • Retirement 401(k) with employer match
  • Paid time off and holidays
  • Paid parental leave plans for all new parents
  • Leave benefits including disability, paid family medical leave, and paid military leave
  • Additional benefits including employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, and employee assistance program

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
PythonGolangJavaCC++KubernetesOpenShiftUnix/LinuxRed Hat Enterprise LinuxPrometheus
Soft skills
collaborative troubleshootingproblem solvingmentoringknowledge sharingcommunicationteamworkself-improvementcustomer interactionpeer reviewincident response