Leadtech Group

Site Reliability Engineer – Manager

Leadtech Group

full-time

Posted on:

Location Type: Remote

Location: Spain

Visit company website

Explore more

AI Apply
Apply

Tech Stack

About the role

  • - Lead a team of Site Reliability Engineers.
  • - Oversee the architecture, scalability, and maintenance of our cloud infrastructure, ensuring system reliability, security, and efficiency.
  • - Act as a player-coach, contributing hands-on to day-to-day engineering and operational tasks.
  • - Define and implement SRE best practices (such as CI/CD pipelines, Infrastructure as Code, automated failovers, chaos engineering, and blameless post-mortems) across all projects under your responsibility.
  • - Define and track crucial reliability metrics (SLIs, SLOs, SLAs, error budgets, MTTR, MTTD) to evaluate the health of our platforms and monitor performance and stability.
  • - Collaborate closely with the Product and Engineering teams to align the infrastructure roadmap with new products and iterations, balancing feature velocity with system reliability.
  • - Assess the technical feasibility of new projects and features, providing high-level estimates, capacity planning, and architectural recommendations.
  • - Enhance team unity, ensuring that everyone feels part of the project and the company, and championing a culture of reliability across the broader engineering organization.

Requirements

  • - You have previous experience in Site Reliability Engineering, systems engineering, or software engineering with an infrastructure focus.
  • - You have experience supporting multiple products with suites of cloud computing services such as Google Cloud and AWS.
  • - You know how to implement SRE best practices (Infrastructure as Code, observability, automation, incident management).
  • - You have experience with AI (e.g., utilizing AI-driven operations/AIOps or supporting AI-based infrastructure).
  • - You have proven leadership of SRE, DevOps, Platform, or Infrastructure teams, ensuring that they are aligned with engineering and work towards a common goal of reliability and scalability.
  • - You have previous experience managing people, delivery, system quality, and operational processes.
  • - You are an expert in promoting teamwork so that SREs collaborate seamlessly with software development teams, creating synergies for continuous improvement and shared ownership.
  • - You like to work in agile and dynamic environments.
  • - You have excellent communication and leadership skills.
  • - You are fluent in English. Spanish is a nice to have!
Benefits
  • Growth and career development
  • - At Leadtech, we prioritize your growth. Enjoy a flexible career path with personalized internal training and an annual budget for external learning opportunities.
  • Work-Life balance
  • - Benefit from a flexible schedule with flextime and the option of working full remote or from our Barcelona office. Enjoy free Friday afternoons with a 7-hour workday, plus a 35-hour workweek in July and August so you can savor summer!
  • Comprehensive benefits
  • - Competitive salary, full-time permanent contract, and top-tier private health insurance (including dental and psychological services).
  • - 25 days of vacation plus your birthday off, with flexible vacation options—no blackout days!
  • Unique Perks
  • - If you wish to come, in our office in Barcelona you’ll find it coplete with free coffee, fresh fruit, snacks, a game room, and a rooftop terrace with stunning Mediterranean views.
  • - Additional benefits include ticket restaurant and nursery vouchers, paid directly from your gross salary.
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Site Reliability EngineeringInfrastructure as CodeCI/CD pipelinesautomated failoverschaos engineeringreliability metricsSLIsSLOsSLAsAIOps
Soft Skills
leadershipteamworkcommunicationcollaborationcapacity planningproblem-solvingagilityculture of reliabilitypeople managementdelivery management