Biohub

Staff Software Engineer – Data Infrastructure, AI Compute Platform

Biohub

full-time

Posted on:

Location Type: Hybrid

Location: Redwood CityCaliforniaUnited States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $214,000 - $295,000 per year

Job Level

About the role

  • Develop and maintain the tooling and infrastructure that drives the entire data lifecycle at Biohub, from ingestion and processing to secure storage and access.
  • Partner with researchers and engineers across genetics, imaging, and literature, ensuring data accessibility and performance.
  • Design and implement flexible, scalable, and performant systems leveraging technologies like Argo Workflows, Slurm, Ray, and AWS Parallel Cluster.

Requirements

  • BS, MS, or PhD in Computer Science or a related technical discipline, or equivalent experience.
  • 8+ years of hands-on coding experience in scripting (Python, PHP, Ruby) and systems languages (Rust, C++, C#, Go, Java, or Scala).
  • Proficiency in managing large-scale data operations, including designing scalable pipelines (streaming and batch).
  • Experience with data governance, metadata, and data lineage tooling like Open Lineage or Marquez.
  • Experience with CI/CD pipelines for data infrastructure and monitoring tooling such as Prometheus, Grafana, OpenTelemetry, or Honeycomb.
  • Experience with addressing end-to-end data needs for model training, working with AI Researchers and AI Engineers.
  • Extensive experience with scaling containerized applications on Kubernetes or Mesos.
  • Strong experience with AWS, GCP, or Azure.
Benefits
  • Provides a generous employer match on employee 401(k) contributions to support planning for the future.
  • Paid time off to volunteer at an organization of your choice.
  • Funding for select family-forming benefits.
  • Relocation support for employees who need assistance moving
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
PythonPHPRubyRustC++C#GoJavaScaladata pipeline design
Certifications
BS in Computer ScienceMS in Computer SciencePhD in Computer Science