
Data Engineer
Iluma Alliance
full-time
Posted on:
Location Type: Remote
Location: Colombia
Visit company websiteExplore more
About the role
- Own scientific data semantics: metadata logic, IDs, experiment keys, joins, and consistency across projects
- Translate scientific outputs into well-defined analytical datasets that preserve biological meaning
- Ensure datasets are analysis-ready, safe, and comparable (baseline vs intervention, customer vs benchmarks)
- Enable technical sales & claims: shape datasets and queries that accelerate recommendations and mechanistic claims
- Support R&D decisions: strengthen data practices that guide strain/formulation prioritization and experimental design
- Design privacy-preserving exposure: define safe keys, pseudonymization rules, and controlled mappings
- Lead cross-team delivery: bridge Biology/R&D and Data/Software (priorities, timelines, validation)
- Promote AI-native workflows: use AI tools to accelerate cleaning, QA, documentation, and repeatable analysis templates
Requirements
- Strong Python and SQL (joins, validation, data quality)
- Experience designing data schemas / relational models and analytical datasets
- Comfort with messy, non-standard scientific data and experimental design logic
- Systems thinking, ownership mindset, strong cross-functional communication
- Familiarity with cloud storage concepts (AWS/S3 or equivalent)
- Active use of AI tools to speed up engineering and analysis
Benefits
- Permanent contract (open-ended)
- Remote role across LATAM
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
PythonSQLdata schemasrelational modelsanalytical datasetsdata qualityexperimental designdata semanticsmetadata logicpseudonymization
Soft skills
systems thinkingownership mindsetcross-functional communicationleadershipcollaborationproblem-solvingdata practicesanalytical thinkingproject managementstakeholder engagement