Salary
💰 $180,000 - $220,000 per year
Tech Stack
ApachePySparkPythonSparkSQL
About the role
- Arbital Health centralizes, measures, and adjudicates value-based care contracts at scale
- Design, train, and test LLMs, RAG frameworks, and ML models for high-value use cases
- Provide cross-functional support to design, validate, and scale AI-driven capabilities on a mission-critical platform
- Analyze large healthcare datasets (claims, revenue, eligibility) to support client engagements and product strategy
- Maintain and enhance feature variable library and grouper logic
- Ensure secure and compliant handling of regulated data (e.g., PHI, HIPAA, financial data)
- Provide clear, actionable bug reports and support QA efforts
- Maintain well-documented, reproducible codebases
- Provide guidance for ethical use of GenAI and machine learning models
- Create and own update processes for model refreshes
- Maintain cleaned and enriched benchmark and de-identified datasets for various applications
Requirements
- Strong proficiency in Python or R
- Hands-on experience with >1M row, regulated data sets (healthcare, finance, etc.)
- Deep experience with generative AI, LLMs, and machine learning models (training, fine-tuning, or evaluation, and deployment)
- Thrives in a collaborative environment, loves to take ownership and accountability
- 5+ years in data science, machine learning, or AI-related roles
- Experience with distributed computing frameworks; familiarity or proficiency in Apache Spark (PySpark, Spark SQL) is a plus
- Proven track record of success in dynamic, fast-paced environments
- Excellent communication, collaboration, and problem-solving skills
- Experience using Git/GitHub for version control and collaborative analytics is a plus.