
Site Reliability Engineer – Manager
Leadtech Group
full-time
Posted on:
Location Type: Remote
Location: Spain
Visit company websiteExplore more
About the role
- - Lead a team of Site Reliability Engineers.
- - Oversee the architecture, scalability, and maintenance of our cloud infrastructure, ensuring system reliability, security, and efficiency.
- - Act as a player-coach, contributing hands-on to day-to-day engineering and operational tasks.
- - Define and implement SRE best practices (such as CI/CD pipelines, Infrastructure as Code, automated failovers, chaos engineering, and blameless post-mortems) across all projects under your responsibility.
- - Define and track crucial reliability metrics (SLIs, SLOs, SLAs, error budgets, MTTR, MTTD) to evaluate the health of our platforms and monitor performance and stability.
- - Collaborate closely with the Product and Engineering teams to align the infrastructure roadmap with new products and iterations, balancing feature velocity with system reliability.
- - Assess the technical feasibility of new projects and features, providing high-level estimates, capacity planning, and architectural recommendations.
- - Enhance team unity, ensuring that everyone feels part of the project and the company, and championing a culture of reliability across the broader engineering organization.
Requirements
- - You have previous experience in Site Reliability Engineering, systems engineering, or software engineering with an infrastructure focus.
- - You have experience supporting multiple products with suites of cloud computing services such as Google Cloud and AWS.
- - You know how to implement SRE best practices (Infrastructure as Code, observability, automation, incident management).
- - You have experience with AI (e.g., utilizing AI-driven operations/AIOps or supporting AI-based infrastructure).
- - You have proven leadership of SRE, DevOps, Platform, or Infrastructure teams, ensuring that they are aligned with engineering and work towards a common goal of reliability and scalability.
- - You have previous experience managing people, delivery, system quality, and operational processes.
- - You are an expert in promoting teamwork so that SREs collaborate seamlessly with software development teams, creating synergies for continuous improvement and shared ownership.
- - You like to work in agile and dynamic environments.
- - You have excellent communication and leadership skills.
- - You are fluent in English. Spanish is a nice to have!
Benefits
- Growth and career development
- - At Leadtech, we prioritize your growth. Enjoy a flexible career path with personalized internal training and an annual budget for external learning opportunities.
- Work-Life balance
- - Benefit from a flexible schedule with flextime and the option of working full remote or from our Barcelona office. Enjoy free Friday afternoons with a 7-hour workday, plus a 35-hour workweek in July and August so you can savor summer!
- Comprehensive benefits
- - Competitive salary, full-time permanent contract, and top-tier private health insurance (including dental and psychological services).
- - 25 days of vacation plus your birthday off, with flexible vacation options—no blackout days!
- Unique Perks
- - If you wish to come, in our office in Barcelona you’ll find it coplete with free coffee, fresh fruit, snacks, a game room, and a rooftop terrace with stunning Mediterranean views.
- - Additional benefits include ticket restaurant and nursery vouchers, paid directly from your gross salary.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Site Reliability EngineeringInfrastructure as CodeCI/CD pipelinesautomated failoverschaos engineeringreliability metricsSLIsSLOsSLAsAIOps
Soft Skills
leadershipteamworkcommunicationcollaborationcapacity planningproblem-solvingagilityculture of reliabilitypeople managementdelivery management