Tech Stack
AWSCloudJavaKafkaNoSQLPythonScalaSparkSQL
About the role
- Design and build ingestion pipelines for healthcare claim and clinical data using modern data engineering tools
Perform unit testing, debugging, and peer reviews to ensure code quality and performance
Collaborate with engineering and product teams to optimize data architecture and integration processes
Contribute to Agile ceremonies (sprint planning, grooming, demos) and update progress in Jira
Troubleshoot and improve legacy data connectors while developing new scalable solutions
Support data quality validation and deliver clear documentation for internal teams
Participate in cross-functional sessions to improve development processes and delivery speed
Requirements
- 5+ years of experience in data engineering roles
Strong SQL skills and experience with relational database design
Hands-on experience with Spark (preferably Scala)
Working knowledge of NoSQL databases and cloud environments (AWS preferred)
Experience with data ingestion tools and frameworks (e.g., Kafka, NiFi, etc.)
Familiarity with Python and Java for data processing
Bonus: Experience with healthcare datasets and analytics
Strong analytical thinking and structured problem-solving skills
Clear communication and ability to collaborate across teams
Ownership mindset and ability to manage multiple priorities
Interest in understanding the business value behind the data