Advarra

Senior Data Scientist

Advarra

full-time

Posted on:

Location Type: Remote

Location: United States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $91,524 - $167,794 per year

Job Level

Tech Stack

About the role

  • Focus on understanding existing models, assessing their performance, selecting optimal architectures, and fine-tuning them to meet specific domain and business needs—including retrieval-augmented generation (RAG) based applications.
  • Collaborate closely with data engineering, product, and domain teams to translate real-world research challenges into scalable, model-driven solutions that accelerate Advarra’s vision of a digitally connected research data and technology fabric.
  • Optimize and fine-tune large language models (LLMs) and domain-specific variants using proprietary datasets to achieve precision and recall targets that drive differentiated customer value.
  • Evaluate model performance across key metrics and benchmarks, identifying strengths, weaknesses, and opportunities for improvement across predictive, generative, and retrieval-augmented tasks.
  • Implement and operationalize LLM-based and retrieval-augmented (RAG) systems that enhance Braid-powered products such as Study Design and Site Feasibility.
  • Collaborate with data engineering to ensure scalable, efficient model training, evaluation, and deployment pipelines using Databricks, MLflow, and Delta Lake.
  • Assess and select models—open-source or proprietary—that best align with domain-specific requirements and Advarra’s regulated research environment.
  • Partner with clinical and operational experts to translate research and trial challenges into measurable model evaluation frameworks and optimization strategies.
  • Conduct model interpretability and bias analyses to ensure fairness, transparency, and compliance with governance standards.
  • Document methodologies and validation results to support internal governance, reproducibility, and audit readiness.
  • Contribute to reusable fine-tuning workflows, evaluation frameworks, and model monitoring pipelines within the Braid AI stack.
  • Stay at the forefront of advancements in LLM optimization, retrieval augmentation, and multi-modal learning, applying new methods that improve scalability, explainability, and cost efficiency

Requirements

  • MS in Machine Learning, Computer Science, or related quantitative discipline, or equivalent relevant work experience.
  • 5+ years of hands-on experience developing and fine-tuning ML or LLM models
  • Demonstrated expertise in Python, with experience and knowledge of a commercial framework like PyTorch.
  • Hands-on experience developing, managing, and troubleshooting workflows within Databricks for data engineering, analytics, and machine learning projects
  • Documented strong understanding of the ML lifecycle
  • Experience with embeddings and retrieval-augmented generation (RAG)
Benefits
  • health coverage
  • paid holidays
  • variable bonus
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
machine learninglarge language modelsmodel fine-tuningmodel evaluationembeddingsretrieval-augmented generationPythonPyTorchML lifecyclemodel interpretability
Soft Skills
collaborationproblem-solvingcommunicationanalytical thinkingorganizational skills
Certifications
MS in Machine LearningMS in Computer Science