Red Hat

Senior Site Reliability Engineer – Openshift/Kubernetes, Golang, Linux

Red Hat

full-time

Posted on:

Location Type: Hybrid

Location: Bangalore • 🇮🇳 India

Visit company website
AI Apply
Apply

Job Level

Senior

Tech Stack

AnsibleAWSAzureChefCloudDistributed SystemsDNSDockerGoGoogle Cloud PlatformJavaKubernetesLinuxOpenShiftPrometheusPuppetPythonTCP/IPUnix

About the role

  • Develop, scale, and operate OpenShift managed cloud services
  • Contribute code to increase the scalability and reliability of the service
  • Help and develop peers’ capabilities through knowledge sharing, mentoring, and collaboration
  • Participate in a regular on-call schedule, including occasional paid weekends and holidays
  • Practice sustainable incident response and blameless postmortems
  • Resolve customer issues escalated from the Red Hat Global Support team
  • Work within a small agile team to develop and improve SRE software, support peers, plan and self-improve
  • Proactively utilize AI-assisted development tools for code generation, auto-completion, and intelligent suggestions
  • Participate in AI-assisted code reviews, utilizing tools that provide real-time feedback
  • Collaborate with cross-functional teams to identify opportunities for AI integration within the software development lifecycle

Requirements

  • Bachelor’s degree in Computer Science, Engineering, or related field; equivalent practical experience will also be considered.
  • 5+ years of experience in at least one programming language (Python, Golang, Java)
  • Hands-on experience with public cloud platforms (AWS, GCP, Azure). Preferably Azure
  • 4+ years of experience with Kubernetes OR Openshift
  • Experience with Docker based containers
  • Strong collaboration and problem-solving skills in distributed, team-based environments.
  • Experience troubleshooting as-a-service offerings (SaaS/PaaS) and working with complex distributed systems.
  • Working knowledge of Linux/Unix operating systems.
  • Proven ability to automate repetitive tasks and debug performance issues.
  • 5+ years of experience managing Linux servers running Red Hat Enterprise Linux (RHEL), CentOS, or Fedora hosted at a cloud provider such as Amazon Web Services (AWS), Google Compute Engine (GCE), or Microsoft Azure 3+ years of experience with enterprise systems monitoring; knowledge of Prometheus is a plus
  • 3+ years of experience with enterprise configuration management software like Ansible by Red Hat, Puppet, or Chef
  • 2+ years of experience delivering a hosted service
  • Demonstrated ability to quickly and accurately troubleshoot system issues
  • Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP
  • Solid communications skills and experience working directly with and presenting to customers
Benefits
  • Health insurance
  • Professional development opportunities
  • Flexible working arrangements

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
PythonGolangJavaKubernetesOpenShiftDockerLinuxRed Hat Enterprise LinuxAnsiblePrometheus
Soft skills
collaborationproblem-solvingmentoringknowledge sharingcommunicationtroubleshootingself-improvementagile teamworkcustomer interactionincident response