Tech Stack
KubernetesOpenShift
About the role
- Install and manage OpenShift/Kubernetes clusters and platform components
- Capacity planning and cluster performance management
- Manage and resolve incidents for container platform and support application issues
- Problem management: identify root causes, corrective actions, and tracking
- Change management: implement changes following events or client requests
- Coordinate analysis, classification and implementation with other teams
- Configure virtual networks and network interfaces within the platform
- Install, configure and operate monitoring tools and alerts; integrate with client tools
- Deploy application event and log collection systems for client teams
- Provide 24x7 critical incident support and participate in on-call shifts
- Perform patch management, periodic infrastructure updates and define backups
- Produce periodic reports on availability and performance
- Manage user profiles, keys and cryptographic certificates
Requirements
- Experience in installing and managing complex infrastructures for medium to large clients
- In-depth technical knowledge of involved technologies
- Ability to work in a team
- Ability to lead troubleshooting and focus on problem-solving
- Relational skills both internally and with clients
- Several years of experience in delivery
- Decision-making skills; Leadership abilities
- Technical certifications (Red Hat, Hyperscalers, others)
- OpenShift / Kubernetes cluster setup
- Capacity management and cluster performance management
- Management and resolution of incidents; problem management, root cause analysis
- Change management experience
- Management of basic components of the OpenShift / Kubernetes platform
- Management of internal virtual network configurations and network interfaces
- Experience with monitoring tools, alert management and integrations
- Experience with application event and log collection systems
- Availability for 24x7 critical incident support and on-call shifts
- Patch management and updates for platform components
- Backup procedures definition and real-time monitoring
- User ID/profile management and LDAP integration experience
- Management of keys and cryptographic certificates
- Benefits that give you choice and support you and your family
- Employee learning programs with access to certifications (Microsoft, Google, Amazon, Skillsoft, and many more)
- Company-wide volunteering and giving platform (donate, start fundraisers, volunteer, search over 2 million non-profit organizations)
- Partially remote work (Partially Remote listed in posting)
ATS Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
OpenShiftKubernetescapacity planningcluster performance managementincident managementproblem managementchange managementmonitoring toolspatch managementbackup procedures
Soft skills
teamworktroubleshootingproblem-solvingrelational skillsdecision-makingleadership
Certifications
Red HatHyperscalers