Feedzai

Senior Data Engineer – Product

Feedzai

full-time

Posted on:

Location Type: Remote

Location: Remote • 🇵🇹 Portugal

Visit company website
AI Apply
Apply

Job Level

Senior

Tech Stack

ApacheCloudETLHadoopHDFSJavaKubernetesLinuxPythonScalaSparkYarn

About the role

  • Re-architect and scale existing big data processing components powering DSF.
  • Analyse workload patterns (Spark jobs, notebook activity, DS API usage) and drive performance, reliability, and cost improvements.
  • Ensure stability of Spark jobs running on EMR or Kubernetes clusters.
  • Operate and evolve Hadoop ecosystem components (HDFS, YARN) and Spark runtimes.
  • Maintain and improve ingestion pipelines between Runtime and DSF (Firehose, Glue → S3).
  • Improve the developer and power-user experience across JupyterLabs and the DS API.
  • Collaborate with product engineers, data scientists, and platform teams on DSF roadmap execution.
  • Own services throughout their lifecycle following DevOps practices (“you build it, you run it”).

Requirements

  • 5+ years of experience building and operating distributed big data systems
  • Strong experience with Apache Spark - tuning, debugging, orchestration
  • Strong programming fundamentals (Java required; Scala or Python a plus)
  • Solid knowledge of the Hadoop ecosystem (HDFS, YARN)
  • Experience operating Linux-based systems in cloud environments
  • Familiarity with JupyterHub/JupyterLabs workflows
  • Experience designing and operating ETL/ELT pipelines
  • Comfort with continuous delivery, monitoring, and on-call responsibilities
  • Ability to work autonomously on complex technical challenges
Benefits
  • Health insurance and wellness programs
  • Professional development opportunities
  • Flexible working hours
  • Remote work options

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
Apache SparkJavaScalaPythonHadoopHDFSYARNETLELTDevOps
Soft skills
autonomyproblem-solvingcollaborationcommunicationperformance analysisreliability improvementcost optimizationdeveloper experience enhancementtechnical challenge resolutionworkload analysis