Build the next-gen data infrastructure for Zinnia using Lakehouse frameworks, Apache Airflow, Apache Spark, and Hive
Design, build, and optimize data workflows across real-time, nearline, and offline data ecosystems
Leverage Lakehouse platforms (Delta Lake, Hudi, Iceberg) to enable unified batch and streaming pipelines
Collaborate with stakeholders and cross-functional teams to understand business requirements and translate them into scalable, data-driven technical solutions
Provide technical expertise in troubleshooting and resolving complex, distributed data-related issues
Stay up to date with Big Data, cloud, and Lakehouse trends and recommend best practices for data engineering and integration
Mentor and guide junior engineers, fostering a culture of innovation, automation, and continuous learning
Requirements
Bachelor’s degree in computer science, Information Technology, or related field
10+ years of experience in Big Data engineering or a similar role, with proven leadership and project management experience
Strong expertise in data integration, transformation, and orchestration using Spark, Hive, and Airflow
Proficiency in Lakehouse platforms (Delta Lake, Apache Hudi, Apache Iceberg) and data warehousing concepts
Familiarity with cloud-based data environments (AWS, GCP, or Azure)
In-depth understanding of scalable data pipelines, distributed computing, and modern data architectures
Programming knowledge in Scala or Python
Knowledge of data quality and governance principles and experience implementing them within the Big Data lifecycle
Excellent communication, leadership, and interpersonal skills
Proven ability to adapt to changing priorities, manage multiple projects simultaneously, and deliver results in a fast-paced environment
Benefits
Health/dental insurance
Parental leave
401(k)
Incentive/bonus opportunity
Tuition reimbursement
Competitive compensation and excellent career progression
Great benefits and other unspecified perks
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
data integrationdata transformationdata orchestrationApache SparkApache HiveApache AirflowDelta LakeApache HudiApache IcebergScala