Maintain and evolve our AWS infrastructure, ensuring high scalability and resilience for our multi-tenant US and EU environments and all our private VPCs.
Design, implement and manage infrastructure as code using Pulumi.
Improve monitoring and observability using Grafana and OpenTelemetry.
Collaborate closely with our engineers team to improve CI/CD pipelines and deployment speed.
Enhance infrastructure security, automation, and cost efficiency.
Contribute to incident response and continuous improvement of our reliability processes.
Requirements
5+ years of experience in DevOps, Site Reliability Engineering, or Cloud Infrastructure.
Deep expertise with AWS (ECS, EC2, Elastic Load Balancing, Cloudfront, VPC, IAM, CloudWatch, Lambda…).
Experience with Pulumi, Terraform, or another Infrastructure as Code tool.
Familiar with Grafana, Prometheus, and OpenTelemetry (or similar monitoring stacks).
Proven ability to manage scalable, multi-region environments and troubleshoot complex distributed systems.
Working knowledge of Azure or GCP is a plus.
Solid understanding of networking.
Comfortable with scripting (Bash, NodeJS) and Git-based CI/CD (GitHub Actions preferred).
Strong problem-solving skills, ownership mindset, and attention to detail.
Fluent in English, both spoken and written.
Able to work in the CET timezone (±1h) for effective collaboration.
Benefits
An exciting scale-up environment with real growth opportunities.
The chance to shape Luzmo’s infrastructure and reliability culture.
Collaboration with a passionate and experienced engineering team.
Competitive salary and benefits.
Flexible holiday policy.
Remote / hybrid working environment.
International company gatherings.
The equipment and tools you need to succeed.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.