Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
Biohub

Data Engineer

Biohub

Data Engineer designing systems that ingest and transform data for biological AI at Biohub. Building infrastructure for training datasets to enhance AI capabilities in biological research.

Posted 5/28/2026full-timeNew York City • California, New York • 🇺🇸 United StatesSeniorLead💰 $241,000 - $338,000 per yearWebsite

Tech Stack

Tools & technologies
AWSCloudRaySpark

About the role

Key responsibilities & impact
  • Design and build data pipelines that process genomic and imaging data at petabyte scale
  • Solve performance and bandwidth challenges with creative engineering
  • Build agent-based systems for automated dataset curation, quality control, and workflow generation
  • Create tooling for data cataloging and registration that makes datasets discoverable and accessible
  • Collaborate with AI Research teams to translate model requirements into data specifications, and with our scientists to integrate public and internal data into large-scale ai-ready datasets
  • Improve pipeline reliability and observability, working toward 99%+ success rates without manual intervention

Requirements

What you’ll need
  • 8+ years experience building reliable, operable data systems at scale (100s terabytes to petabytes)
  • Strong software engineering fundamentals
  • Experience deploying distributed computing frameworks like Databricks, Spark, or Ray for large-scale data processing
  • Experience with cloud infrastructure (AWS preferred) and HPC environments
  • Comfort with ambiguity; ability to make progress when requirements are evolving
  • Interest in AI-native development practices and tooling
  • Nice to have: Background in computational biology, bioinformatics, or life sciences and experience with genomics datasets and formats (FASTQ, BAM, VCF) or imaging formats (OME-Zarr, HDF5)

Benefits

Comp & perks
  • Provides a generous employer match on employee 401(k) contributions to support planning for the future.
  • Paid time off to volunteer at an organization of your choice.
  • Funding for select family-forming benefits.
  • Relocation support for employees who need assistance moving

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
data pipelinesgenomic data processingimaging data processingdistributed computing frameworksDatabricksSparkRaycloud infrastructureHPC environmentsAI-native development
Soft Skills
problem solvingcreativitycollaborationadaptabilitycommunication