Midi Health

Lead Data Engineer

Midi Health

full-time

Posted on:

Origin:  • 🇺🇸 United States • California

Visit company website
AI Apply
Apply

Salary

💰 $180,000 - $230,000 per year

Job Level

Senior

Tech Stack

Distributed SystemsETLPySparkPythonSQLTableau

About the role

  • Spearhead design, implementation, and iteration of a modern data infrastructure powering analytics, data science, and ML/AI systems at Midi Health
  • Define and execute the strategic roadmap for data infrastructure and analytics capabilities across the organization
  • Partner with Data Science, Operations Analytics, Engineering, and Product on design and implementation of scalable data pipelines, models, and solutions
  • Drive the development of foundational data products and tools to power self-service analytics
  • Actively contribute to and influence engineering processes, culture, practices, and systems
  • Serve as a technical thought leader across teams on data engineering best practices
  • Lead 0-1 data platform initiatives and help scale data systems at a rapidly growing startup

Requirements

  • 5+ years of experience
  • Experienced with the modern data engineering stack (dbt, pySpark, Fivetran, Snowflake, Lakehouse, CDP’s, ETL tools, etc.)
  • Advanced knowledge of SQL and Python
  • Deep expertise in data pipelines, distributed systems, and analytics infrastructure
  • Hands-on experience with Data Warehousing technologies, Data Lake architecture, and ETL pipelines and tools
  • Deep understanding of BI tooling infrastructure and semantic layer design (ex. Looker, Tableau, Metabase, Mode)
  • Experience and interest in leading major architecture initiatives from the ground up
  • Belief in applying best in class software engineering practices to data systems
  • Interest in coaching/mentoring junior engineers
  • Bonus (preferred): experience building data products that meet HIPAA requirements
  • Bonus (preferred): experience building platforms that support realtime and batch ML/AI products and systems
  • Bonus (preferred): experience integrating EHR and other complex 3rd party system data