Data Engineer

• Design, build, and automate batch data pipelines that ingest from files, APIs, and SFTP into BigQuery and product environments.
• Implement scheduling, monitoring, retries, and backfills to ensure reliable and repeatable workflows.
• Establish guardrails such as schema management, versioning, and basic SLAs for data freshness and reliability.
• Productionize ML/AI batch jobs and publish outputs into analytics-ready tables.
• Maintain and refresh healthcare reference datasets (e.g., NPI, codesets, CMS lists) on schedule.
• Document pipelines clearly and make outputs consumable for analytics and BI teams.
• Handle PHI with care and follow HIPAA-aligned data governance practices.

Data Engineer

Job Level

Tech Stack

About the role

Requirements