Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
Red Hat

Customer Site Reliability Engineer – OpenShift Managed Cloud Services, Spoken Japanese, Kubernetes/AWS/Azure, Linux

Red Hat

Customer Site Reliability Engineer managing large-scale systems for cloud services at Red Hat. Focused on enhancing service reliability, customer satisfaction, and technical escalation management.

Posted 6/30/2026full-timeRemote • 🇦🇺 AustraliaMid-LevelSeniorWebsite

Tech Stack

Tools & technologies
AnsibleAWSAzureCloudDistributed SystemsGoGoogle Cloud PlatformKubernetesLinuxOpenShiftPrometheusTCP/IPTerraform

About the role

Key responsibilities & impact
  • Manage large-scale, distributed systems, focusing on minimizing downtime and improving system resilience.
  • Maintain customer trust and confidence by ensuring stability and functionality of services.
  • Drive continuous enhancement of processes, tools, and methodologies to support the evolving needs of the service.
  • Lead the development of code and automation scripts to optimize the scalability, reliability, and performance of services.
  • Lead and participate in high-priority customer escalations, adopting a customer-first mindset.
  • Coordinate and execute complex incident response procedures, ensuring timely resolution and thorough postmortems.
  • Collaborate with cross-functional teams to enhance system robustness.
  • Demonstrate a proactive mindset to help preempt escalations and ensure reliable operations.
  • Document resolutions, root causes, and best practices to enrich the knowledge base and promote self-service solutions.
  • Mentor and coach team members, fostering a culture of continuous learning, knowledge sharing and collaboration.
  • Participate in on-call rotation and provide leadership during critical incidents.
  • Collaborate on strategic AI and automation projects designed to increase the efficiency of fleet operations and troubleshooting, ultimately delivering a better product experience for customers.

Requirements

What you’ll need
  • Advanced Experience with OpenShift/Kubernetes container platform support or administration.
  • Proficient with container-based technologies on Linux.
  • Proficient in managing Linux-based systems in a public cloud such as AWS, Azure, or GCP.
  • Advanced experience with enterprise systems monitoring; knowledge of Prometheus is preferred.
  • Advanced with enterprise configuration management such as Ansible, Terraform.
  • Software engineering experience using object-oriented languages; golang is preferred.
  • Superior communications skills and experience working directly with and presenting to customers.
  • Ability to quickly learn new technologies and follow industry trends.
  • Demonstrated ability to quickly and accurately troubleshoot systems issues.
  • Solid understanding of standard TCP/IP networking and common protocols.
  • Fluent in English and any additional language like Japanese, Chinese, Korean, Spanish is an advantage.

Benefits

Comp & perks
  • Flexible working hours
  • Professional development opportunities

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
OpenShiftKubernetesLinuxAWSAzureGCPPrometheusAnsibleTerraformObject-Oriented Programming
Soft Skills
Superior Communication SkillsProactive MindsetMentoringCollaborationCustomer-First Mindset