
Senior Observability Engineer
FARFETCH
full-time
Posted on:
Location Type: Hybrid
Location: Porto • Portugal
Visit company websiteExplore more
Job Level
About the role
- Design and build robust observability infrastructure, focusing on the scalability, high availability, and performance of our core telemetry platforms.
- Manage large-scale data pipelines and storage, ensuring the reliable ingestion, processing, and querying of Logs, Metrics, and Traces using technologies like OpenTelemetry (OTEL), Fluentbit, the Victoria ecosystem (Metrics, Logs, Traces).
- Drive the creation of self-service capabilities, empowering engineering teams and FARFETCH Business Units to be autonomous in their analysis, detection, and investigation efforts.
- Ensure the reliability and performance of our internal observability stack, making sure systems like Grafana, and various exporters remain robust even during high-traffic events.
- Contribute to technical strategy and roadmaps, identifying opportunities to optimize cost efficiency, storage, and reliability through modern infrastructure patterns.
- Collaborate and support the integration of Observability Frameworks (Audit, Logging, Monitoring, etc), ensuring they fit seamlessly into the broader platform architecture.
- Provide technical mentorship and guidance to other engineers, fostering a culture of technical excellence, psychological safety, and continuous learning within the team.
Requirements
- You have a proven track record of designing, building, and maintaining complex observability platforms and distributed systems at scale.
- You possess a strong technical background in telemetry infrastructure, with hands-on experience managing tools like OpenTelemetry, Fluentbit, Victoria Metrics/Logs, Grafana.
- You have solid experience with cloud infrastructure and container orchestration (Kubernetes), allowing you to make and guide complex architectural decisions regarding the observability stack.
- You understand the software development lifecycle deeply and have experience with development languages (e.g., Golang or Python) to effectively build tools, exporters, or automations.
- You have strong analytical and problem-solving skills, with a relentless focus on continuous system improvement, infrastructure as code, and automation.
- You have good interpersonal and communication skills, with the ability to collaborate effectively and influence technical stakeholders across different Business Units.
- You prioritize data security, compliance, and cost-efficiency as core pillars of a reliable observability platform.
Benefits
- Health insurance for the whole family, flexible working environment and well-being support and tools
- Extra days off, sabbatical program and days for you to give back for the community
- Training opportunities and free access to Udemy
- Flexible benefits program
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
observability infrastructuredata pipelinestelemetry platformscloud infrastructurecontainer orchestrationGolangPythoninfrastructure as codeautomationscalability
Soft Skills
analytical skillsproblem-solving skillsinterpersonal skillscommunication skillscollaborationinfluencetechnical mentorshipcontinuous learningpsychological safetytechnical excellence