Clean, transform, and prepare large, complex healthcare datasets for machine learning model development (missing values, outlier detection, feature engineering, normalization).
Identify, collect, and curate relevant industry-specific datasets and format data for LLM training pipelines.
Design, train, and fine-tune various LLMs on extensive healthcare data to solve clinical or operational problems.
Set up and manage training environments, including GPU instances and required software; optimize hyperparameters and memory usage.
Integrate structured and unstructured data for multi-modal/multi-input models.
Evaluate model performance using appropriate metrics and implement optimization strategies.
Develop and maintain robust, scalable data and ML pipelines for training, inference, and deployment.
Collaborate with data scientists, clinicians, and software engineers to integrate models into production and ensure data privacy and security compliance.
Stay up-to-date with advancements in machine learning and healthcare AI and explore new technologies.
Maintain clear documentation of models, data pipelines, and experimental results.
Requirements
Education: Bachelor's or Master's degree in Computer Science, Machine Learning, Artificial Intelligence, or a related quantitative field.
5+ years of experience in Machine Learning Engineering or a similar role.
Proven experience with large-scale data preprocessing, LLM/model training, and fine-tuning.
Experience with distributed training (PyTorch Distributed, DeepSpeed, Ray, Hugging Face Accelerate).
Experience with GPU/TPU optimization and memory management for large language models.
Experience working with healthcare data is highly desirable.
Proficiency in Python and relevant ML libraries (TensorFlow, PyTorch, Scikit-learn, Pandas, NumPy).
Strong understanding of various machine learning algorithms, Large Language Models, and deep learning architectures.
Experience with cloud platforms (GCP, AWS) and distributed computing frameworks (Spark) is a plus.
Familiarity with MLOps practices and tools.
Excellent problem-solving and analytical skills.
Strong communication and collaboration abilities.
Ability to work independently and as part of a team in a fast-paced environment.
Benefits
Competitive salary and benefits package.
Flexible working arrangements (remote or hybrid options available).
The opportunity to work on life-changing AI technology that directly impacts patient outcomes.
Join a team that combines cutting-edge innovation with a mission to save lives and improve health equity.
Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
machine learninglarge language modelsdata preprocessingfeature engineeringhyperparameter optimizationmodel evaluationdata pipelinesdeep learningPythonMLOps
Soft skills
problem-solvinganalytical skillscommunicationcollaborationindependenceteamworkadaptabilityattention to detailtime managementcreativity