
Principal Data Scientist
Phastar
full-time
Posted on:
Location Type: Remote
Location: United Kingdom
Visit company websiteExplore more
Job Level
About the role
- Develop LLM-powered automation tools for clinical reporting and document generation workflows
- Build AI-driven code generation pipelines and quality assessment frameworks
- Design and implement human-in-the-loop review workflows with feedback loops to continuously improve output quality
- Design and build multi-agent systems for data workflows — agents that retrieve, generate, validate, and iterate autonomously
- Implement agent orchestration using frameworks such as Google ADK, LangGraph, or LangChain
- Deploy and manage agents on Google Vertex AI
- Use AI coding tools (e.g. Gemini CLI, GitHub Copilot) as a core part of your development workflow
- Critically review and validate AI-generated code — understanding what it produces, why, and when it’s wrong
- Write effective prompts to direct AI tools toward correct, secure, well-structured output
Requirements
- Strong Python skills
- Hands-on experience building RAG systems or LLM-powered applications (using LangChain, LlamaIndex, or similar frameworks)
- Experience integrating LLM APIs (Google Gemini, OpenAI, or similar) — we work primarily through Google Vertex AI
- Working knowledge of vector databases (ChromaDB, Weaviate, Qdrant, Pinecone, or similar)
- Docker and containerized deployments
- Comfortable using AI-assisted development tools (e.g. Gemini CLI, GitHub Copilot) — and critically evaluating what they produce
Benefits
- Professional development opportunities
- Flexible working hours
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PythonRAG systemsLLM-powered applicationsLLM APIsvector databasesDockercontainerized deploymentsAI coding toolsprompt engineeringcode validation