
Site Reliability Engineer – Storage
Scaleway
full-time
Posted on:
Location Type: Hybrid
Location: Paris • France
Visit company websiteExplore more
About the role
- Continuously improve the reliability and scalability of our platforms.
- Automate infrastructure to optimize deployments and reduce human intervention.
- Collaborate with Dev, Product and Ops teams to ensure high-performing, resilient services.
- Develop tools and frameworks to streamline deployments and infrastructure management.
- Automate repetitive tasks to improve efficiency and reliability.
- Establish key metrics (SLOs, KPIs) to monitor service performance.
- Optimize monitoring and alerting systems to minimize alert fatigue.
- Identify, diagnose and quickly resolve production incidents.
- Analyze root causes and implement preventive measures.
- Apply best practices (fault tolerance, load balancing, redundancy) to strengthen system resilience.
- Optimize resource usage to reduce energy consumption and improve performance.
- Work closely with Dev & Product teams to integrate reliability by design.
- Participate in architecture reviews and share SRE best practices.
Requirements
- Experience with Infrastructure as Code (IaC) and CI/CD.
- Proficiency with monitoring and logging tools.
- Strong knowledge of Linux systems and production troubleshooting.
- Ability to work in English and to collaborate effectively in a team.
- Development experience (Go, Rust).
- Focus on developer experience and an aptitude for coaching.
- Experience with distributed storage (S3, CephFS, ZFS).
Benefits
- A cutting-edge technical environment with exciting challenges.
- A culture of innovation and knowledge sharing where expertise and creativity are encouraged.
- A strong commitment to a more responsible cloud, with eco-designed data centers.
- Onboarding support including a tour of our offices and opportunities to meet your future colleagues.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
Infrastructure as CodeCI/CDLinux systemsGoRustmonitoring toolslogging toolsdistributed storageS3CephFS
Soft skills
collaborationcoachingteamworkproblem-solvingcommunication