Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
Finom

Senior Site Reliability Engineer, SRE

Finom

. Lead the Platform Evolution: Design and operate our Kubernetes ecosystem (GKE, multi-cluster) with a focus on high availability and zero-downtime operations.

Posted 5/27/2026full-timeRemote • 🇪🇸 SpainSeniorWebsite

Tech Stack

Tools & technologies
AWSCloudGoogle Cloud PlatformGrafanaKubernetesPrometheusTerraform

About the role

Key responsibilities & impact
  • Lead the Platform Evolution: Design and operate our Kubernetes ecosystem (GKE, multi-cluster) with a focus on high availability and zero-downtime operations.
  • Build "Paved Roads": Own and evolve our PaaS strategy, using GitOps (ArgoCD) and CI/CD (GitLab) to empower domain teams to deploy independently.
  • Architect Reliability: Define and implement our observability strategy across metrics, logs, and tracing (Prometheus, VictoriaMetrics, OpenTelemetry).
  • Drive Infrastructure-as-Code: Lead the automation of our infrastructure using Terraform, ensuring all resources are standardized and version-controlled.
  • Own the Error Budget: Partner with engineering teams to establish and manage SLOs, SLAs, and incident management frameworks.
  • Disaster Recovery Mastery: Design and participate in regular DR drills, implementing blue/green and active/passive strategies across regions to ensure service continuity.
  • Innovate Operations: Proactively apply AI-driven approaches to improve operational efficiency and automated bottleneck detection.

Requirements

What you’ll need
  • Production K8s Mastery: Strong hands-on experience managing Kubernetes (GKE preferred) in high-load, multi-cluster production environments.
  • Cloud Infrastructure: Deep experience with GCP (AWS is a strong plus) and Terraform for large-scale infrastructure.
  • GitOps Expertise: Solid experience with ArgoCD, GitLab CI, and the 'Infrastructure as Code' philosophy.
  • Observability Expert: Deep knowledge of the Prometheus/Grafana stack and implementing tracing/logging at scale.
  • System Design: Proven ability to design highly available 24/7 systems with automated failover and rollback capabilities.
  • English Fluency: English level B2+ for effective cross-functional communication.

Benefits

Comp & perks
  • Make a genuine impact on the product
  • Join our upward trajectory, and grow with us.
  • Work in the EU
  • Enjoy the flexibility of traveling and working remotely or in a hybrid model across Europe.
  • Become a stock options holder
  • Unlock your inner entrepreneur and align your aspirations with ours through our Stock Options Program.
  • Receive unwavering support and care
  • Constant support and care to ensure your Finom experience is successful and fulfilling.
  • Work & Swim program
  • Spend one month in a comfortable corporate apartment in enchanting Cyprus.
  • Equal Opportunity Statement
  • We embrace diversity and invite applications from all walks of life.

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
KubernetesGKEGitOpsArgoCDCI/CDGitLabTerraformPrometheusOpenTelemetryGrafana
Soft Skills
leadershipcommunicationcollaborationproblem-solvinginnovation