Lead the development of business requirements for data curation through collaboration with R&D business and data platform teams
Maintain strong connections with analytical groups and R&D Data Platform teams to ensure seamless data integration and usage
Deliver pre-packaged, curated datasets aligned to business requirements for analytics, which includes documenting data specification that clearly describes the required processing steps to generate analysis-ready datasets ensuring providence, lineage and privacy requirements is maintained
Integrate diverse datasets into a unified format for consistent analysis
Ensure all datasets meet analysis-ready and privacy requirements by performing necessary data curation activities
Provide coaching and peer review to ensure that the team’s work reflects industry best practices for data curation activities
Ensure that datasets are processed to meet conditions mentioned in the approved data re-use request
Write clean, readable code
Ensure that deliverables are appropriately quality controlled, documented, and when required, can be handed over to R&D Tech team for production pipeline implementation
Requirements
BSc/MSc/PhD (or equivalent) in Computer Science, Mathematics, Statistics, or related subject
Proven experience of handling various modalities of scientific clinical data such as clinical trial data (including biomarkers), real world data (RWD), omics etc.
Experience in Python, Databricks, Delta Lake, PySpark, Pandas, other data engineering frameworks and applying them to achieve industry standards-compliant datasets
Proven ability to handle and process large structured, semi-structured, and unstructured datasets efficiently
Strong communication skills and expertise to translate business needs into technical data requirements and processes
Ability to quantify and provide insights to business impact and value creation from data curation activities
Experience with at least one of the industry data standards such as CDISC(ODM: CDASH, SDTM, ADaM), HL7 FHIR, OMOP(CDM) etc.
Benefits
competitive salary
annual bonus based on company performance
healthcare and wellbeing programmes
pension plan membership
shares and savings programme
hybrid working model
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.