RefinedScience

Data Engineering Intern

RefinedScience

internship

Posted on:

Location Type: Remote

Location: United States

Visit company website

Explore more

AI Apply
Apply

Job Level

Tech Stack

About the role

  • Assist in building and maintaining data pipelines for ingesting, transforming, and validating clinical, biological, and real-world data
  • Support integration of data from multiple sources (e.g., clinical data, analytics outputs, external datasets)
  • Help develop and optimize ETL/ELT workflows to ensure data quality and reliability
  • Collaborate with data science and bioinformatics teams to support analytics and machine learning workflows
  • Contribute to data modeling, documentation, and best practices for data infrastructure
  • Participate in code reviews, testing, and performance improvements
  • Participate in Quality Reviews and Troubleshooting
  • Communicate progress and findings to cross-functional teams

Requirements

  • Currently enrolled in a Bachelor’s, Master’s, or Ph.D. program in Data Engineering, Computer Science, Data Science, Software Engineering, or a related field
  • Experience with Python and/or SQL through coursework, projects, or internships
  • Basic understanding of data pipelines, databases, and data transformation concepts
  • Familiarity with version control (e.g., Git)
  • Strong analytical thinking and problem-solving skills
  • Ability to learn quickly and work collaboratively in a team environment
Benefits
  • Flexible work arrangements
  • Professional development
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
PythonSQLETLELTdata pipelinesdata transformationdata modelingdata validationanalyticsmachine learning
Soft Skills
analytical thinkingproblem-solvingcollaborationcommunicationadaptability