FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Customer Site Reliability Engineer – OpenShift Managed Cloud Services, Spoken Japanese, Kubernetes/AWS/Azure, Linux
Red HatCustomer Site Reliability Engineer managing large-scale systems for cloud services at Red Hat. Focused on enhancing service reliability, customer satisfaction, and technical escalation management.
Tech Stack
Tools & technologiesAnsibleAWSAzureCloudDistributed SystemsGoGoogle Cloud PlatformKubernetesLinuxOpenShiftPrometheusTCP/IPTerraform
About the role
Key responsibilities & impact- Manage large-scale, distributed systems, focusing on minimizing downtime and improving system resilience.
- Maintain customer trust and confidence by ensuring stability and functionality of services.
- Drive continuous enhancement of processes, tools, and methodologies to support the evolving needs of the service.
- Lead the development of code and automation scripts to optimize the scalability, reliability, and performance of services.
- Lead and participate in high-priority customer escalations, adopting a customer-first mindset.
- Coordinate and execute complex incident response procedures, ensuring timely resolution and thorough postmortems.
- Collaborate with cross-functional teams to enhance system robustness.
- Demonstrate a proactive mindset to help preempt escalations and ensure reliable operations.
- Document resolutions, root causes, and best practices to enrich the knowledge base and promote self-service solutions.
- Mentor and coach team members, fostering a culture of continuous learning, knowledge sharing and collaboration.
- Participate in on-call rotation and provide leadership during critical incidents.
- Collaborate on strategic AI and automation projects designed to increase the efficiency of fleet operations and troubleshooting, ultimately delivering a better product experience for customers.
Requirements
What you’ll need- Advanced Experience with OpenShift/Kubernetes container platform support or administration.
- Proficient with container-based technologies on Linux.
- Proficient in managing Linux-based systems in a public cloud such as AWS, Azure, or GCP.
- Advanced experience with enterprise systems monitoring; knowledge of Prometheus is preferred.
- Advanced with enterprise configuration management such as Ansible, Terraform.
- Software engineering experience using object-oriented languages; golang is preferred.
- Superior communications skills and experience working directly with and presenting to customers.
- Ability to quickly learn new technologies and follow industry trends.
- Demonstrated ability to quickly and accurately troubleshoot systems issues.
- Solid understanding of standard TCP/IP networking and common protocols.
- Fluent in English and any additional language like Japanese, Chinese, Korean, Spanish is an advantage.
Benefits
Comp & perks- Flexible working hours
- Professional development opportunities
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
OpenShiftKubernetesLinuxAWSAzureGCPPrometheusAnsibleTerraformObject-Oriented Programming
Soft Skills
Superior Communication SkillsProactive MindsetMentoringCollaborationCustomer-First Mindset