
Data Engineer
Ansible Health
full-time
Posted on:
Location Type: Remote
Location: Remote • New York • 🇺🇸 United States
Visit company websiteJob Level
Mid-LevelSenior
Tech Stack
AirflowAWSAzureCloudDockerETLGoogle Cloud PlatformKafkaKubernetesMySQLNoSQLPostgresPythonSQL
About the role
- Design, develop, and maintain scalable data pipelines to ingest, process, and integrate data from multiple healthcare systems, medical devices, and external sources
- Build and optimize data models and architectures that support clinical analytics, machine learning applications, and operational reporting
- Implement robust data quality controls, validation procedures, and monitoring systems to ensure data accuracy and reliability
- Create and maintain data documentation, including data dictionaries, lineage diagrams, and process workflows
- Collaborate with data scientists to provide efficient data access patterns and support the deployment of ML models into production
- Develop APIs and data services that enable secure, compliant access to healthcare data for internal teams and external partners
- Implement data security and privacy measures that ensure compliance with HIPAA and other healthcare regulations.
Requirements
- Bachelor's or Master's degree in Computer Science, Information Systems, or related technical field
- 3+ years of experience in data engineering or similar roles
- Strong programming skills in Python and proficiency with SQL
- Experience with ETL tools and frameworks (Airflow, dbt, or similar)
- Hands-on experience with both relational databases (PostgreSQL, MySQL) and NoSQL technologies
- Familiarity with cloud platforms (AWS, GCP, or Azure) and their data services
- Knowledge of data modeling, warehouse design, and dimensional modeling concepts
- Understanding of data security best practices, particularly in sensitive data environments
- Experience working with healthcare data, particularly EHR systems, FHIR, or HL7 standards (preferred)
- Knowledge of HIPAA compliance requirements for data handling and processing (preferred)
- Familiarity with real-time data processing using technologies like Kafka or Kinesis (preferred)
- Experience with containerization (Docker) and orchestration (Kubernetes) (preferred)
- Understanding of data governance frameworks and implementation (preferred)
- Experience supporting machine learning workflows and model deployment (preferred)
- Knowledge of cardiopulmonary conditions or healthcare analytics (preferred).
Benefits
- Opportunity to make a significant impact on patient care for complex conditions and shape the future of digital health.
- A highly collaborative, mission-driven, and innovative work environment.
- Exposure to cutting-edge health technology, AI applications, and data practices.
- Competitive salary, equity options, and comprehensive benefits package.
- Significant opportunities for professional development and career growth.
- Work alongside experts from top clinical and technology institutions.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
PythonSQLETLAirflowdbtPostgreSQLMySQLNoSQLAWSGCP