Natera

Senior Data Engineer, Platform & Pipelines

Natera

full-time

Posted on:

Location Type: Remote

Location: United States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $125,000 - $155,000 per year

Job Level

About the role

  • Architect, implement, and maintain data ingestion and transformation pipelines using modern workflow orchestration tools (e.g. Dagster).
  • Identify, catalog, and integrate internal and external data sources used across research efforts.
  • Operationalize bioinformatics pipelines that support large-scale batch processing, incremental updates, and backfills within AWS.
  • Normalize and structure heterogeneous data into consistent, reusable representations that support downstream analysis, modeling, and querying.
  • Populate and maintain patient-centric data models in shared storage systems (e.g., graph and relational databases).
  • Collaborate with backend and AI engineers to design data-access patterns that support analytics applications and AI-driven interactions.
  • Contribute to backend services and APIs that expose integrated data to internal tools and applications.
  • Participate in the evolution of AI-enabled analysis workflows, including tooling that supports LLM- or agent-based interactions with data.
  • Contribute to system-level design decisions around data flow, service boundaries, reliability, and scalability.
  • Write clean, tested, and well-documented Python code that meets production software engineering standards.
  • Debug and resolve complex data quality, pipeline, backend, and infrastructure issues in a distributed environment.

Requirements

  • BS in Computer Science, Bioinformatics, Computational Biology, or a related field, MS preferred.
  • 4+ years of experience in production data engineering or software engineering.
  • Independently drive technical solutions from high-level goals, exercising judgment in system design, implementation, and tradeoff evaluation.
  • Strong proficiency in Python, with experience writing maintainable, production-quality code across data and backend contexts.
  • Extensive experience with software engineering fundamentals, design patterns, version control, CI/CD, Docker, and automated testing.
  • Experience designing and operating workflow orchestration systems (Dagster preferred; Airflow, Prefect, or similar acceptable).
  • Experience building or contributing to backend services (e.g., FastAPI or similar frameworks).
  • Hands-on experience with AWS services commonly used in data and backend systems (e.g., S3, ECS, Batch, Lambda).
  • Experience deploying and operating large-scale data or bioinformatics pipelines in AWS, including managing throughput, cost, and operational reliability.
  • Experience with relational databases (Postgres, MySQL) and/or graph databases (Neo4j), including schema and query design.
  • Experience contributing to system-level architecture, including data modeling, service boundaries, and operational robustness.
  • Ability to work effectively with scientists, bioinformaticians, and ML practitioners in an R&D environment.
Benefits
  • Comprehensive medical, dental, vision, life and disability plans for eligible employees and their dependents.
  • Free testing for Natera employees and their immediate families in addition to fertility care benefits.
  • Pregnancy and baby bonding leave.
  • 401k benefits.
  • Commuter benefits.
  • Generous employee referral program!
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Pythondata engineeringbioinformaticsworkflow orchestrationCI/CDDockerbackend servicesrelational databasesgraph databasesdata modeling
Soft Skills
collaborationindependent problem solvingjudgment in system designcommunication
Certifications
BS in Computer ScienceMS in Computer Science