SysEleven GmbH

Senior Site Reliability Engineer – Kubernetes Platform

SysEleven GmbH

full-time

Posted on:

Location Type: Hybrid

Location: BerlinGermany

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Design and implement observability solutions using Prometheus, Loki and Mimir, including defining meaningful alerts
  • Analyze, troubleshoot and further develop custom Kubernetes controllers to ensure reliability and stability
  • Develop and maintain production applications with a focus on code quality, scalability and operational readiness
  • Operate, automate and continuously evolve the MKA platform with a focus on efficiency and maintainability
  • Enhance internal tooling to drive automation and reduce manual effort

Requirements

  • Experience operating highly available, business-critical applications in cloud and on-premises environments, including incident leadership
  • Strong Kubernetes knowledge and experience in cluster management
  • Experience with GitOps principles and ArgoCD for deployment and delivery workflows
  • Experience with Infrastructure as Code, particularly Terraform and Ansible
  • Proficient in Bash and/or Python for automation and tooling
  • Understanding of CI/CD pipelines, ideally with Tekton-based workflows
  • Very good German skills and good English skills (B2+) for technical collaboration
  • Nice to have: experience programming in Go
  • Experience with Nix for development tooling and automation
  • Experience with Helm, Make and Git
  • Additional experience with cloud-native platforms, observability or platform automation
Benefits
  • Deep hands-on Kubernetes experience
  • Freedom to solve challenges
  • Opportunities to share knowledge and continuously learn
  • Collaborative team environment
  • Internal show-and-tell sessions
  • Attendance at conferences such as KubeCon or Container Days
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
KubernetesGitOpsArgoCDInfrastructure as CodeTerraformAnsibleBashPythonCI/CDGo
Soft Skills
incident leadershiptechnical collaboration