
Senior Site Reliability Engineer
SysEleven GmbH
full-time
Posted on:
Location Type: Remote
Location: Germany
Visit company websiteExplore more
Job Level
About the role
- Ensure the reliability, availability, and performance of our Database- and Observability-as-a-Service products
- Manage container-based applications in Kubernetes with a strong focus on security and resilience
- Lead incident response, root cause analysis, and sustainable remediation efforts
- Apply GitOps principles using Helm and Argo CD
- Develop API services and tooling in Go to deliver stable SaaS products
- Build and optimize CI/CD pipelines to improve deployment safety and system stability
- Design and manage scalable infrastructure using IaC tools (e.g., Terraform) in cloud environments
Requirements
- Several years of experience operating highly available systems in Linux and Kubernetes environments
- Strong understanding of observability concepts (monitoring, logging, tracing)
- Practical development experience in Go (knowledge of Python or Rust is a plus)
- Experience with Infrastructure-as-Code tools such as Terraform or OpenTofu
- Hands-on experience in incident management and structured root cause analysis
- Familiarity with CI systems, especially GitLab CI
- Strong problem-solving skills and good communication skills in German and English (minimum B2 level)
Benefits
- Blameless culture
- Open communication
- Knowledge sharing
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
KubernetesGoTerraformGitOpsCI/CDAPI developmentLinuxobservabilityincident managementInfrastructure-as-Code
Soft Skills
problem-solvingcommunication