
Senior Site Reliability Engineer – Openshift/Kubernetes, Golang, Linux
Red Hat
full-time
Posted on:
Location Type: Hybrid
Location: Bangalore • 🇮🇳 India
Visit company websiteJob Level
Senior
Tech Stack
AnsibleAWSAzureChefCloudDistributed SystemsDNSDockerGoGoogle Cloud PlatformJavaKubernetesLinuxOpenShiftPrometheusPuppetPythonTCP/IPUnix
About the role
- Develop, scale, and operate OpenShift managed cloud services
- Contribute code to increase the scalability and reliability of the service
- Help and develop peers’ capabilities through knowledge sharing, mentoring, and collaboration
- Participate in a regular on-call schedule, including occasional paid weekends and holidays
- Practice sustainable incident response and blameless postmortems
- Resolve customer issues escalated from the Red Hat Global Support team
- Work within a small agile team to develop and improve SRE software, support peers, plan and self-improve
- Proactively utilize AI-assisted development tools for code generation, auto-completion, and intelligent suggestions
- Participate in AI-assisted code reviews, utilizing tools that provide real-time feedback
- Collaborate with cross-functional teams to identify opportunities for AI integration within the software development lifecycle
Requirements
- Bachelor’s degree in Computer Science, Engineering, or related field; equivalent practical experience will also be considered.
- 5+ years of experience in at least one programming language (Python, Golang, Java)
- Hands-on experience with public cloud platforms (AWS, GCP, Azure). Preferably Azure
- 4+ years of experience with Kubernetes OR Openshift
- Experience with Docker based containers
- Strong collaboration and problem-solving skills in distributed, team-based environments.
- Experience troubleshooting as-a-service offerings (SaaS/PaaS) and working with complex distributed systems.
- Working knowledge of Linux/Unix operating systems.
- Proven ability to automate repetitive tasks and debug performance issues.
- 5+ years of experience managing Linux servers running Red Hat Enterprise Linux (RHEL), CentOS, or Fedora hosted at a cloud provider such as Amazon Web Services (AWS), Google Compute Engine (GCE), or Microsoft Azure 3+ years of experience with enterprise systems monitoring; knowledge of Prometheus is a plus
- 3+ years of experience with enterprise configuration management software like Ansible by Red Hat, Puppet, or Chef
- 2+ years of experience delivering a hosted service
- Demonstrated ability to quickly and accurately troubleshoot system issues
- Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP
- Solid communications skills and experience working directly with and presenting to customers
Benefits
- Health insurance
- Professional development opportunities
- Flexible working arrangements
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
PythonGolangJavaKubernetesOpenShiftDockerLinuxRed Hat Enterprise LinuxAnsiblePrometheus
Soft skills
collaborationproblem-solvingmentoringknowledge sharingcommunicationtroubleshootingself-improvementagile teamworkcustomer interactionincident response