gridX

Site Reliability Engineer – Cloud Infrastructure

gridX

full-time

Posted on:

Location Type: Remote

Location: Remote • 🇩🇪 Germany

Visit company website
AI Apply
Apply

Job Level

Junior

Tech Stack

CloudDistributed SystemsKubernetes

About the role

  • Actively evolve our multi-tenant cloud and container infrastructure
  • Take end-to-end ownership of various components, ensuring they are secure, scalable, observable, and cost-efficient
  • Bring a developer's mindset to operations, solving complexity by writing high-quality code and automation
  • Mature our observability platform, providing the insights teams need to drive architectural decisions, improve performance, and establish meaningful SLOs
  • Proactively identify bottlenecks before they become incidents and lead the resolution when things break
  • Build self-service capabilities that allow engineering teams to own their full lifecycle
  • Drive the adoption of best practices through code or architecture reviews and technical deep-dives
  • Share expertise through high-quality documentation and operational runbooks

Requirements

  • Solid experience in an SRE or Platform role, building and managing distributed systems in production environments
  • Comfortable working with a high degree of autonomy, navigating ambiguity and driving technical initiatives end-to-end
  • Strong hands-on experience with a major public cloud provider
  • Understanding of the architectural foundations of cloud infrastructure (Compute, Storage, Networking, and IAM) and fluent in managing them as code
  • Pragmatic software engineering mindset to operations
  • Write clean, maintainable code and scripts, prioritizing long-term stability and quality
  • Operational experience with Kubernetes at scale, understanding how to manage upgrades, security and resource allocation in a production cluster
  • Embody a "Reliability First" mindset, understanding incident lifecycle management and the importance of psychological safety in engineering
Benefits
  • Flexible & mobile working: Work remotely for up to 70 days from anywhere in the EU and other selected countries such as Indonesia, Canada, Brazil and many more
  • Vacation: 30 days for your relaxation
  • Sports: 30 Euro allowance for Urban Sports Club or E-Gym
  • Health: Make use of our (mental) health management offers such as Nilo.health (e.g. 1:1 coaching sessions, daily meditation offers, Self-reflection options) for your mental-wellbeing
  • Personal development: Annual development budget of 1,500 euros per employee
  • Employee discounts: Access to gridX Corporate Benefits
  • Stay fit and safe the planet with our JobRad offer
  • Set up a pension plan and receive a fair monthly contribution
  • City travel subsidy: 30 Euros monthly allowance for your monthly/annual ticket
  • Modern workplace in the hearts of Aachen and Munich with IT equipment of your choice ( Apple or Lenovo )
  • Annual Teamweek: Enjoy an unforgettable off-site, face extraordinary challenges together with all gridX teams and create unforgettable memories!
  • Experience the gridX culture at regular team events and receive 100 Euros on top per employee for your department event
  • We will donate 20 Euros to a charity of your choice on your birthday
  • Sabbatical option: Take a break from the daily work routine and realize personal projects, travel or further education (depends on length of employment)

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
cloud infrastructuredistributed systemsKubernetesautomationobservabilitysoftware engineeringincident lifecycle managementresource allocationsecurity managementscalability
Soft skills
autonomyproblem-solvingtechnical initiative drivingdocumentationcommunicationcollaborationadaptabilityleadershipcritical thinkingpsychological safety