
Site Reliability Engineer – Cloud Infrastructure
gridX
full-time
Posted on:
Location Type: Remote
Location: Remote • 🇩🇪 Germany
Visit company websiteJob Level
Junior
Tech Stack
CloudDistributed SystemsKubernetes
About the role
- Actively evolve our multi-tenant cloud and container infrastructure
- Take end-to-end ownership of various components, ensuring they are secure, scalable, observable, and cost-efficient
- Bring a developer's mindset to operations, solving complexity by writing high-quality code and automation
- Mature our observability platform, providing the insights teams need to drive architectural decisions, improve performance, and establish meaningful SLOs
- Proactively identify bottlenecks before they become incidents and lead the resolution when things break
- Build self-service capabilities that allow engineering teams to own their full lifecycle
- Drive the adoption of best practices through code or architecture reviews and technical deep-dives
- Share expertise through high-quality documentation and operational runbooks
Requirements
- Solid experience in an SRE or Platform role, building and managing distributed systems in production environments
- Comfortable working with a high degree of autonomy, navigating ambiguity and driving technical initiatives end-to-end
- Strong hands-on experience with a major public cloud provider
- Understanding of the architectural foundations of cloud infrastructure (Compute, Storage, Networking, and IAM) and fluent in managing them as code
- Pragmatic software engineering mindset to operations
- Write clean, maintainable code and scripts, prioritizing long-term stability and quality
- Operational experience with Kubernetes at scale, understanding how to manage upgrades, security and resource allocation in a production cluster
- Embody a "Reliability First" mindset, understanding incident lifecycle management and the importance of psychological safety in engineering
Benefits
- Flexible & mobile working: Work remotely for up to 70 days from anywhere in the EU and other selected countries such as Indonesia, Canada, Brazil and many more
- Vacation: 30 days for your relaxation
- Sports: 30 Euro allowance for Urban Sports Club or E-Gym
- Health: Make use of our (mental) health management offers such as Nilo.health (e.g. 1:1 coaching sessions, daily meditation offers, Self-reflection options) for your mental-wellbeing
- Personal development: Annual development budget of 1,500 euros per employee
- Employee discounts: Access to gridX Corporate Benefits
- Stay fit and safe the planet with our JobRad offer
- Set up a pension plan and receive a fair monthly contribution
- City travel subsidy: 30 Euros monthly allowance for your monthly/annual ticket
- Modern workplace in the hearts of Aachen and Munich with IT equipment of your choice ( Apple or Lenovo )
- Annual Teamweek: Enjoy an unforgettable off-site, face extraordinary challenges together with all gridX teams and create unforgettable memories!
- Experience the gridX culture at regular team events and receive 100 Euros on top per employee for your department event
- We will donate 20 Euros to a charity of your choice on your birthday
- Sabbatical option: Take a break from the daily work routine and realize personal projects, travel or further education (depends on length of employment)
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
cloud infrastructuredistributed systemsKubernetesautomationobservabilitysoftware engineeringincident lifecycle managementresource allocationsecurity managementscalability
Soft skills
autonomyproblem-solvingtechnical initiative drivingdocumentationcommunicationcollaborationadaptabilityleadershipcritical thinkingpsychological safety