
Senior Data Scientist
Advarra
full-time
Posted on:
Location Type: Remote
Location: United States
Visit company websiteExplore more
Salary
💰 $91,524 - $167,794 per year
Job Level
About the role
- Focus on understanding existing models, assessing their performance, selecting optimal architectures, and fine-tuning them to meet specific domain and business needs—including retrieval-augmented generation (RAG) based applications.
- Collaborate closely with data engineering, product, and domain teams to translate real-world research challenges into scalable, model-driven solutions that accelerate Advarra’s vision of a digitally connected research data and technology fabric.
- Optimize and fine-tune large language models (LLMs) and domain-specific variants using proprietary datasets to achieve precision and recall targets that drive differentiated customer value.
- Evaluate model performance across key metrics and benchmarks, identifying strengths, weaknesses, and opportunities for improvement across predictive, generative, and retrieval-augmented tasks.
- Implement and operationalize LLM-based and retrieval-augmented (RAG) systems that enhance Braid-powered products such as Study Design and Site Feasibility.
- Collaborate with data engineering to ensure scalable, efficient model training, evaluation, and deployment pipelines using Databricks, MLflow, and Delta Lake.
- Assess and select models—open-source or proprietary—that best align with domain-specific requirements and Advarra’s regulated research environment.
- Partner with clinical and operational experts to translate research and trial challenges into measurable model evaluation frameworks and optimization strategies.
- Conduct model interpretability and bias analyses to ensure fairness, transparency, and compliance with governance standards.
- Document methodologies and validation results to support internal governance, reproducibility, and audit readiness.
- Contribute to reusable fine-tuning workflows, evaluation frameworks, and model monitoring pipelines within the Braid AI stack.
- Stay at the forefront of advancements in LLM optimization, retrieval augmentation, and multi-modal learning, applying new methods that improve scalability, explainability, and cost efficiency
Requirements
- MS in Machine Learning, Computer Science, or related quantitative discipline, or equivalent relevant work experience.
- 5+ years of hands-on experience developing and fine-tuning ML or LLM models
- Demonstrated expertise in Python, with experience and knowledge of a commercial framework like PyTorch.
- Hands-on experience developing, managing, and troubleshooting workflows within Databricks for data engineering, analytics, and machine learning projects
- Documented strong understanding of the ML lifecycle
- Experience with embeddings and retrieval-augmented generation (RAG)
Benefits
- health coverage
- paid holidays
- variable bonus
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
machine learninglarge language modelsmodel fine-tuningmodel evaluationembeddingsretrieval-augmented generationPythonPyTorchML lifecyclemodel interpretability
Soft Skills
collaborationproblem-solvingcommunicationanalytical thinkingorganizational skills
Certifications
MS in Machine LearningMS in Computer Science