
Data Scientist I – Cancer Center
Cleveland Clinic
full-time
Posted on:
Location Type: Remote
Location: Ohio • United States
Visit company websiteExplore more
Job Level
About the role
- Apply statistical and machine learning techniques to solve moderately complex analytical problems involving large, unstructured datasets
- Support advanced analytics initiatives with a strong focus on large language models (LLMs)
- Contribute to the development of AI agents, LLM pipelines, and reusable data abstractions
- Partner with senior and lead data scientists to deliver high-impact analytical solutions
- Collaborate closely with other data scientists and software engineers
- Translate analytical concepts into scalable, production-ready systems
- Participate in model building and development under direction of other Data Scientists
- Utilize methods in modeling, AI, Machine Learning (ML), Deep Learning (DL), and Natural Language Processing (NLP)
Requirements
- Bachelor’s Degree in Statistician, Actuarial Science, Econometrics, Physics, Biostatistics, Computer Science, Applied Mathematics, Engineering, Business Analytics, Economics, Finance or related field
- Excellent written, verbal, and presentation skills in English and ability to explain the value of Machine Learning (ML) and Artificial Intelligence (AI) to business leaders
- 18 months of related experience working with relational databases and/or distributed computing platforms, and their query interfaces, such as SQL, Teradata, MapReduce, PIG, and Hive OR Master’s Degree can substitute for experience
- Experience working with a variety of statistical languages/packages, e.g., SAS, R, Python, Spark, and/or SPSS
- Knowledge applying advanced statistics to complex business problems required (e.g., modeling, AI, ML, Deep Learning [DL], and/or Natural Language Processing [NLP])
- Familiarity with additional programming languages, including Python, Java, or C/C++
- Experience leveraging visualization software and techniques and business intelligence (BI) software
- Technical knowledge of distributed computing platforms, and common data process flows from data instrumentation & generation, to ETL, to the data warehouse itself
- Demonstrated leadership qualities, including presentation, influencing and negotiation
Benefits
- Health insurance
- 401(k) matching
- Paid time off
- Professional development opportunities
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
statistical techniquesmachine learningdeep learningnatural language processingmodelingdata abstractionrelational databasesSQLPythonR
Soft Skills
written communicationverbal communicationpresentation skillsinfluencingnegotiationcollaborationleadership