Tech Stack
ApacheAWSAzureBigQueryCloudGoogle Cloud PlatformJavaKafkaKubernetesMicroservicesPythonSparkSQL
About the role
- Join one of our Data Science Teams distributed between Paris, Barcelona and Brussels, providing forecasts to operational and business entities
- Support high-priority projects around product pricing, product ranking, warehouse efficiency and search
- Build and maintain data pipelines and ensure data quality in the context of machine learning projects
- Deploy, version and monitor machine learning models at scale on a mixed cloud/on-premise architecture
- Develop, deploy and maintain high-load APIs and microservices on Kubernetes to expose ML-based predictions to customer-facing products
- Identify, design and implement internal process improvements: automate manual processes, optimize code and data delivery, redesign infrastructure for scalability
- Contribute to and grow the Veepee data engineering community: attend conferences, contribute to open-source, organize and attend meetups
Requirements
- At least 5 years of experience in software engineering, preferably in the data field
- Strong knowledge of Java, SQL, and Python
- Experience with data processing technologies like Apache Beam (or Flink), Spark, Kafka, etc. is a plus
- Experience with distributed data systems: No-SQL databases (e.g. BigTable), Trino, data lakes (e.g. BigQuery), storage (e.g. GCS / S3)
- Used to work in a cloud environment (GCP, AWS, Azure,...)
- Familiar with the concepts of microservice architecture
- Prior experience with deploying containers on a platform like Kubernetes and its ecosystem is a big plus
- Proficiency with version control Git
- Experience with tools like dbt is a plus
- Interest in machine learning and data analytics
- A strong team player, willing to share knowledge with other team members and help out where needed
- Strong verbal and written English language skills.