Salary
💰 $75,000 - $130,000 per year
Tech Stack
AirflowAWSCloudJavaPythonSparkSQL
About the role
- Build and maintain data processing pipeline and tools using state-of-the-art technologies.
- Work with Python on Spark-based data pipelines.
- Develop algorithms to build complex data relationships.
- Build analytical data structures to support reporting.
- Build and maintain Data Quality processes.
- Collaborate with Product team to adapt our reference data to changing demands in the market.
Requirements
- 3+ years of experience developing data pipelines using cloud-managed Spark clusters (e.g. AWS EMR, Databricks)
- Fluent in Python or Java and Spark (3+ years of experience)
- Previous experience building tools and libraries to automate and streamline data processing workflows.
- Proficient with SQL / SparkSQL
- Hands-on experience working with a Data Lakehouse.
- Good verbal and written communication and proven experience of working and delivering in an Agile environment.
- Veeva is not sponsoring H1B or supporting H1 transfers for this role.