Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
Flip

Site Reliability Engineer

Flip

Site Reliability Engineer responsible for maintaining a highly available cloud infrastructure. Supporting Flip's rapid growth while ensuring reliability and performance of systems.

Posted 4/23/2026full-timeRemote • 🇩🇪 GermanyJuniorWebsite

Tech Stack

Tools & technologies
AnsibleAWSAzureChefCloudGoGoogle Cloud PlatformGrafanaKotlinKubernetesPrometheusPythonTerraform

About the role

Key responsibilities & impact
  • Further expand and optimize our cloud infrastructure on Azure and our Kubernetes clusters - designed for high throughput and highest availability - to support Flip's rapid growth across the globe.
  • Design and implement zero-downtime deployments, rollback mechanisms and disaster-recovery strategies that keep our platform available around the clock.
  • Evolve our LGTM stack (Loki, Grafana, Tempo, Mimir) to give every team the visibility they need - and use it to define and optimize our SLOs.
  • Design, develop and optimize infrastructure as code with Pulumi in Go, eliminating toil and making our platform self-service for engineering teams.
  • Promote CI/CD best practices, incident management, post-mortems and developer experience across the entire engineering organization.
  • Collaborate with your squad and engineering leadership to define the platform's direction - from scalable, high-throughput systems and cost optimization to security posture and compliance.

Requirements

What you’ll need
  • You have 1–3 years of hands-on experience as a Site Reliability Engineer (SRE), Platform Engineer, DevOps Engineer, Infrastructure Engineer, Cloud Engineer, or Backend Engineer with a strong infrastructure focus.
  • Experience operating and scaling cloud infrastructures (Azure, GCP, AWS).
  • Deep knowledge of Kubernetes and container orchestration in production environments.
  • Hands-on experience with modern observability stacks (e.g. Prometheus, Mimir, Loki, ELK) and comfortable defining and operating SLOs and error budgets.
  • Solid software development skills in Go (preferred, since our IaC runs on Pulumi in Go), Python or Kotlin.
  • Hands-on experience with infrastructure as code (e.g. Pulumi, OpenTofu, Terraform) and configuration tooling (e.g. Ansible, Chef).
  • A collaborative mindset, strong communication skills and business-fluent English.
  • Willingness to participate in on-call rotations to ensure the reliability of our platform.

Benefits

Comp & perks
  • Work mode: We’re remote-first, giving you flexibility to work from home. At the same time, we deeply value the power of in-person collaboration. Depending on the role, you’ll join occasional team events, workshops, or meetings in our Berlin or Stuttgart offices - always with plenty of notice. The exact balance will be discussed during your interview.
  • Work-Life-Balance: We don't want you to grow roots to your desk chair. That's why we cover the costs of your E-Gym-Wellpass membership and offer job bike leasing.
  • Celebrating success: Expect highly motivated and committed people in a relaxed working atmosphere.
  • Be part of something bigger: You actively shape Flip in your role. Along the way, you are an enabler of the rapid growth process of a young tech company and grow towards your goals, fun is guaranteed.
  • Happy to be a Flipster: Stay tuned for regular team events and culture days that bring us together as Flipsters.
  • Working abroad: At Flip you can also work abroad in the European Union. Let's talk about remote work in the interview.

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
cloud infrastructureKuberneteszero-downtime deploymentsdisaster-recovery strategiesinfrastructure as codePulumiGoobservability stacksSLOserror budgets
Soft Skills
collaborative mindsetstrong communication skillsbusiness-fluent English