Tech Stack
ApacheAWSAzureBigQueryCloudGoogle Cloud PlatformJavaKafkaKubernetesMicroservicesPythonSparkSQL
About the role
- Join one of our Data Science Teams distributed between Paris, Barcelona and Brussels
- Build data pipelines and ensure data quality for machine learning projects
- Deploy machine learning models on a mixed cloud/on-premise architecture and version, deploy and monitor models at scale
- Leverage LLMs and develop microservices to expose ML-based predictions to customer-facing products
- Develop, deploy and maintain high-load APIs on Kubernetes with strong SLA requirements and high business value
- Identify, design, and implement internal process improvements: automation, code and data delivery optimization, infrastructure redesign for scalability
- Keep improving technical knowledge and expertise by animating the Veepee data engineering community, attending conferences, contributing to open-source projects, and organizing meetups.
Requirements
- At least 5 years of experience in software engineering, preferably in the data field
- Strong knowledge of Java, SQL, and Python
- Experience with data processing technologies like Apache Beam (or Flink), Spark, Kafka, etc. is a plus
- Experience with distributed data systems: No-SQL databases (e.g. BigTable), Trino, data lakes (e.g. BigQuery), storage (e.g. GCS / S3)
- Used to work in a cloud environment (GCP, AWS, Azure,...)
- Familiar with the concepts of microservice architecture
- Prior experience with deploying containers on a platform like Kubernetes and its ecosystem is a big plus
- Proficiency with version control Git
- Experience with tools like dbt is a plus
- Interest in machine learning and data analytics
- A strong team player, willing to share knowledge with other team members and help out where needed
- Strong verbal and written English language skills.