Phastar

Principal Data Scientist

Phastar

full-time

Posted on:

Location Type: Remote

Location: United Kingdom

Visit company website

Explore more

AI Apply
Apply

Job Level

Tech Stack

About the role

  • Develop LLM-powered automation tools for clinical reporting and document generation workflows
  • Build AI-driven code generation pipelines and quality assessment frameworks
  • Design and implement human-in-the-loop review workflows with feedback loops to continuously improve output quality
  • Design and build multi-agent systems for data workflows — agents that retrieve, generate, validate, and iterate autonomously
  • Implement agent orchestration using frameworks such as Google ADK, LangGraph, or LangChain
  • Deploy and manage agents on Google Vertex AI
  • Use AI coding tools (e.g. Gemini CLI, GitHub Copilot) as a core part of your development workflow
  • Critically review and validate AI-generated code — understanding what it produces, why, and when it’s wrong
  • Write effective prompts to direct AI tools toward correct, secure, well-structured output

Requirements

  • Strong Python skills
  • Hands-on experience building RAG systems or LLM-powered applications (using LangChain, LlamaIndex, or similar frameworks)
  • Experience integrating LLM APIs (Google Gemini, OpenAI, or similar) — we work primarily through Google Vertex AI
  • Working knowledge of vector databases (ChromaDB, Weaviate, Qdrant, Pinecone, or similar)
  • Docker and containerized deployments
  • Comfortable using AI-assisted development tools (e.g. Gemini CLI, GitHub Copilot) — and critically evaluating what they produce
Benefits
  • Professional development opportunities
  • Flexible working hours
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
PythonRAG systemsLLM-powered applicationsLLM APIsvector databasesDockercontainerized deploymentsAI coding toolsprompt engineeringcode validation