Build and curate datasets for training and evaluating our AI models.
Build and maintain robust evaluation pipelines to rigorously measure and continuously improve AI performance.
Prompt, evaluate, and iterate on LLM workflows—retrieval, agent logic, tool use—to ensure they lead the industry in accuracy and reliability.
Own the deployment lifecycle—designing APIs, integrating model inference into our Python‑based backend, instrumenting observability, and monitoring performance in production.
Stay deeply connected to the latest developments in AI research and rapidly integrate relevant advancements.
Collaborate directly with product engineers, clinical leaders, and the CTO to ensure our AI directly meets clinical needs.
Exercise significant autonomy to choose projects and drive them from concept to completion.
Requirements
3+ years of experience working in applied AI or machine learning.
Strong hands-on experience with modern AI tools, LLMs, and evaluation methods.
Proficiency in Python with ability to integrate AI solutions into production backend systems.
Ability to independently identify impactful projects and proactively drive them from concept to completion. This role suits someone who thrives with autonomy and ownership rather than task-driven work.
Comfort rapidly learning new techniques and staying current in a fast-evolving AI landscape.
Bonus points: Experience in healthcare or health-tech environments.
Bonus points: Have previously worked at or founded a startup (founder mindset).