Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
C the Signs

AI Data Engineer

C the Signs

Data Engineer developing scalable data pipelines for LLMs and ML models in healthcare. Collaborating with teams to ensure data quality, compliance, and advanced data engineering practices.

Posted 4/28/2026full-timeRemote • Massachusetts, New Hampshire, New Jersey, New York, Rhode Island • 🇺🇸 United StatesMid-LevelSeniorWebsite

Tech Stack

Tools & technologies
AirflowApacheAWSAzureCloudETLGoogle Cloud PlatformJavaPythonScalaSpark

About the role

Key responsibilities & impact
  • Collaborate with data scientists and machine learning engineers to understand data requirements for LLM and machine learning model fine-tuning.
  • Design, build, and maintain scalable data pipelines to ingest, process, and store massive and diverse healthcare datasets.
  • Implement robust data validation and monitoring to ensure the integrity, accuracy, and consistency of all training datasets.
  • Implement robust data cleaning, validation, and transformation processes to ensure data quality and integrity.
  • Develop and optimize data structures and schemas for efficient access and utilization by LLMs and machine learning models.
  • Work with the team to identify and acquire new data sources, ensuring compliance with relevant healthcare regulations (e.g., HIPAA).
  • Monitor data pipeline performance, troubleshoot issues, and implement optimizations to improve efficiency and reliability.
  • Document data engineering processes, data models, and data dictionaries.
  • Stay up-to-date with the latest advancements in data engineering, big data technologies, and machine learning.

Requirements

What you’ll need
  • Required
  • - Bachelor's degree in Computer Science, Engineering, or a related field.
  • - Proven experience as a Data Engineer, with a focus on big data technologies.
  • - Strong proficiency in programming languages such as Python, Scala, or Java.
  • - Extensive experience with data warehousing, ETL processes, and data modeling.
  • - Experience with major cloud providers (e.g., AWS, GCP, Azure) and their data storage and processing services.
  • - Hands-on experience with big data frameworks like Apache Spark for distributed processing.
  • - Excellent problem-solving skills and the ability to work independently and as part of a team.
  • - Strong communication and interpersonal skills.
  • Preferred
  • - Master's degree in a related field.
  • - Experience with healthcare data and a good understanding of healthcare data standards (e.g., FHIR, HL7).
  • - Familiarity with machine learning concepts and LLM fine-tuning processes.
  • - Experience with data orchestration tools (e.g., Apache Airflow).
  • Work Authorization:
  • - Must be a US Citizen, Green Card holder, or currently in the US have valid H1B visa

Benefits

Comp & perks
  • **Why Join Us?**
  • Joining **C the Signs** is not just about building AI; it’s about shaping the future of healthcare. If you are a technical leader with an unshakable belief in the power of AI to save lives and the ability to make it happen at scale, this is your opportunity to create a tangible, global impact.
  • **Benefits:**
  • - Competitive salary and benefits package.
  • - Flexible working arrangements (remote or hybrid options available).
  • - The opportunity to work on life-changing AI technology that directly impacts patient outcomes.
  • - Join a team that combines cutting-edge innovation with a mission to save lives and improve health equity.
  • - Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
data engineeringbig data technologiesPythonScalaJavadata warehousingETL processesdata modelingApache Sparkdata validation
Soft Skills
problem-solvingindependent workteam collaborationcommunicationinterpersonal skills
Certifications
Bachelor's degree in Computer ScienceMaster's degree in a related field