Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
H1

Staff Data Engineer

H1

Staff Data Engineer on the Real World Evidence team driving data initiatives at H1. Focused on data architecture and large-scale pipeline execution for healthcare systems.

Posted 6/3/2026full-timeNew York City • New York • 🇺🇸 United StatesLead💰 $170,000 - $190,000 per yearWebsite

Tech Stack

Tools & technologies
AirflowAWSCloudKafkaKubernetesPySparkPythonSparkSQL

About the role

Key responsibilities & impact
  • Act as a self-starter who drives execution independently, taking ownership and initiative with minimal need for day-to-day direction.
  • Lead high-visibility RWE projects, starting with claims data, and keep multiple initiatives moving by proactively unblocking teams.
  • Own the end-to-end architecture for critical data assets, ensuring solutions are scalable, reliable, and aligned with H1’s long-term vision.
  • Design, build, and optimize large-scale data pipelines (hundreds of TBs) for performance, reliability, and cost efficiency.
  • Partner with Product, Data Science, and downstream engineering teams to align priorities, manage dependencies, and deliver high-value outcomes.
  • Represent engineering in cross-functional forums, shaping roadmaps and reducing reliance on senior leadership for day-to-day decisions.
  • Develop deep domain expertise and mentor other engineers, helping raise the technical bar and influence the evolution of our data products.

Requirements

What you’ll need
  • 8+ years as a software, data, or backend engineer building and operating scalable, production-grade systems.
  • Experience with large-scale data processing (e.g., Spark/PySpark on EMR or similar) or scalable distributed backend systems, with the ability to quickly deepen expertise in our data stack (PySpark, EMR, Hudi/Delta).
  • Strong proficiency in SQL, including writing and optimizing complex queries over large datasets.
  • Strong programming experience in Python (or a modern language with the ability to quickly ramp up in Python).
  • Experience designing systems or large-scale datasets/pipelines with attention to performance, reliability, and maintainability.
  • Hands-on experience with modern engineering workflows and tooling such as Git, JIRA, and CI/CD systems (e.g., CircleCI).
  • Comfort deploying and troubleshooting distributed workloads in cloud environments such as AWS EMR or Kubernetes.
  • Experience with workflow orchestration or job scheduling tools (e.g., Airflow, Argo).
  • Demonstrated ability to independently drive complex, cross-team technical initiatives and influence stakeholders without formal authority.
  • Experience with streaming/messaging technologies (e.g., Kafka, Kinesis) nice to have
  • Background in RWE, healthcare data, or other complex/regulated data domains is preferred
  • Experience using AI-assisted coding tools (e.g., GitHub Copilot, Claude Code) to accelerate development while maintaining quality is encouraged

Benefits

Comp & perks
  • Full suite of health insurance options, in addition to generous paid time off
  • Pre-planned company-wide wellness holidays
  • Retirement options
  • Health & charitable donation stipends
  • Impactful Business Resource Groups
  • Flexible work hours & the opportunity to work from anywhere

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
large-scale data processingSparkPySparkSQLPythondata pipelinesworkflow orchestrationstreaming technologiesKubernetesAI-assisted coding tools
Soft Skills
self-starterownershipinitiativementoringinfluencing stakeholderscross-team collaborationproblem-solvingcommunicationtechnical leadershipindependent execution