
Senior Data Engineer – Product
Feedzai
full-time
Posted on:
Location Type: Remote
Location: Remote • 🇵🇹 Portugal
Visit company websiteJob Level
Senior
Tech Stack
ApacheCloudETLHadoopHDFSJavaKubernetesLinuxPythonScalaSparkYarn
About the role
- Re-architect and scale existing big data processing components powering DSF.
- Analyse workload patterns (Spark jobs, notebook activity, DS API usage) and drive performance, reliability, and cost improvements.
- Ensure stability of Spark jobs running on EMR or Kubernetes clusters.
- Operate and evolve Hadoop ecosystem components (HDFS, YARN) and Spark runtimes.
- Maintain and improve ingestion pipelines between Runtime and DSF (Firehose, Glue → S3).
- Improve the developer and power-user experience across JupyterLabs and the DS API.
- Collaborate with product engineers, data scientists, and platform teams on DSF roadmap execution.
- Own services throughout their lifecycle following DevOps practices (“you build it, you run it”).
Requirements
- 5+ years of experience building and operating distributed big data systems
- Strong experience with Apache Spark - tuning, debugging, orchestration
- Strong programming fundamentals (Java required; Scala or Python a plus)
- Solid knowledge of the Hadoop ecosystem (HDFS, YARN)
- Experience operating Linux-based systems in cloud environments
- Familiarity with JupyterHub/JupyterLabs workflows
- Experience designing and operating ETL/ELT pipelines
- Comfort with continuous delivery, monitoring, and on-call responsibilities
- Ability to work autonomously on complex technical challenges
Benefits
- Health insurance and wellness programs
- Professional development opportunities
- Flexible working hours
- Remote work options
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
Apache SparkJavaScalaPythonHadoopHDFSYARNETLELTDevOps
Soft skills
autonomyproblem-solvingcollaborationcommunicationperformance analysisreliability improvementcost optimizationdeveloper experience enhancementtechnical challenge resolutionworkload analysis