Salary
💰 $136,000 - $170,000 per year
Tech Stack
NumpyPandasPythonPyTorchSQLTensorflow
About the role
- Datavant is a data platform company and the world’s leader in health data exchange. Datavant is looking for an enthusiastic and meticulous Data Scientist to join our growing team, which builds machine learning models for use across Datavant in multiple verticals and for multiple customer types.
- You will play a crucial role in developing new product features and automating existing internal processes to drive innovation across Datavant. You will work with tens of millions of patients’ worth of healthcare data to develop models, contributing to the entirety of the model development lifecycle from ideation and research to deployment and monitoring. You will collaborate with an experienced team of Data Scientists and Machine Learning Engineers along with application Engineers and Product Managers across the company to achieve Datavant’s AI-enabled future.
- You Will: Play a key role in the success of our products by developing models for NLP (and other) tasks; Perform error analysis, data cleaning, and other related tasks to improve models; Collaborate with your team by making recommendations for the development roadmap of a capability; Work with other data scientists and engineers to optimize machine learning models and insert them into end-to-end pipelines; Understand product use-cases and define key performance metrics for models according to business requirements; Set up systems for long-term improvement of models and data quality (e.g. active learning, continuous learning systems, etc.).
- What You Will Bring to the Table: Advanced degree in computer science, data science, statistics, or a related field, or equivalent work experience.
- 4+ years of experience with data science and machine learning in an industry setting.
- 4+ years experience with Python.
- Experience designing and building NLP models for tasks such as classification, named-entity recognition, and dependency parsing.
- Proficiency with standard data analysis toolkits such as SQL, Numpy, Pandas, etc.
- Proficiency with deep learning frameworks like PyTorch (preferred) or TensorFlow.
- Demonstrated ability to drive results in a team environment and contribute to team decision-making in the face of ambiguity.
- Strong time management skills and demonstrable experience of prioritising work to meet tight deadlines.
- Initiative and ability to independently explore and research novel topics and concepts as they arise.
Requirements
- Advanced degree in computer science, data science, statistics, or a related field, or equivalent work experience.
- 4+ years of experience with data science and machine learning in an industry setting.
- 4+ years experience with Python.
- Experience designing and building NLP models for tasks such as classification, named-entity recognition, and dependency parsing.
- Proficiency with standard data analysis toolkits such as SQL, Numpy, Pandas, etc.
- Proficiency with deep learning frameworks like PyTorch (preferred) or TensorFlow.
- Demonstrated ability to drive results in a team environment and contribute to team decision-making in the face of ambiguity.
- Strong time management skills and demonstrable experience of prioritising work to meet tight deadlines.
- Initiative and ability to independently explore and research novel topics and concepts as they arise.