
Senior AI Data Engineer
IQVIA
full-time
Posted on:
Location Type: Office
Location: Madrid • Spain
Visit company websiteExplore more
Job Level
Tech Stack
About the role
- Lead the development and optimization of data infrastructure supporting Agentic AI initiatives.
- Collaborate with ML engineers, AI scientists, and product managers to architect, implement, and maintain robust data pipelines.
- Design, develop, and maintain scalable data pipelines and ETL processes supporting AI research and development.
- Monitor and troubleshoot data pipeline issues to ensure continuity and reliability.
- Drive data platform reliability, scalability, and cost optimization across cloud-based infrastructure.
Requirements
- Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field; advanced degree preferred.
- 5+ years of professional experience in data engineering, including at least 2 years focused on ML/AI data infrastructure.
- Advanced proficiency in Python and Scala; experience with Rust, Go, Java, or Julia is valued.
- Expert-level knowledge of SQL and NoSQL databases.
- Hands-on experience with vector databases (e.g., Pinecone, Weaviate, Milvus).
- Proficiency with modern data orchestration platforms (e.g., Airflow 2.x).
- Extensive experience with at least one major cloud platform (AWS, Azure, or GCP).
- Expertise in containerization and orchestration (Docker, Kubernetes).
- Experience with Infrastructure as Code tooling (e.g., Terraform).
- Experience with distributed computing frameworks (Spark, Dask, Ray).
- Proficiency with streaming technologies (Kafka, Flink).
- Knowledge of modern data lakehouse architectures.
- Certifications in cloud platforms, big data technologies, engineering, or ML operations preferred.
Benefits
- Professional development opportunities
- Flexible working arrangements
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PythonScalaSQLNoSQLETLdata pipelinesdata engineeringcloud infrastructurecontainerizationdistributed computing