GSK

Data Curation Developer

GSK

full-time

Posted on:

Location Type: Hybrid

Location: Munich • 🇩🇪 Germany

Visit company website
AI Apply
Apply

Job Level

Mid-LevelSenior

Tech Stack

PandasPySparkPython

About the role

  • Lead the development of business requirements for data curation through collaboration with R&D business and data platform teams
  • Maintain strong connections with analytical groups and R&D Data Platform teams to ensure seamless data integration and usage
  • Deliver pre-packaged, curated datasets aligned to business requirements for analytics, which includes documenting data specification that clearly describes the required processing steps to generate analysis-ready datasets ensuring providence, lineage and privacy requirements is maintained
  • Integrate diverse datasets into a unified format for consistent analysis
  • Ensure all datasets meet analysis-ready and privacy requirements by performing necessary data curation activities
  • Provide coaching and peer review to ensure that the team’s work reflects industry best practices for data curation activities
  • Ensure that datasets are processed to meet conditions mentioned in the approved data re-use request
  • Write clean, readable code
  • Ensure that deliverables are appropriately quality controlled, documented, and when required, can be handed over to R&D Tech team for production pipeline implementation

Requirements

  • BSc/MSc/PhD (or equivalent) in Computer Science, Mathematics, Statistics, or related subject
  • Proven experience of handling various modalities of scientific clinical data such as clinical trial data (including biomarkers), real world data (RWD), omics etc.
  • Experience in Python, Databricks, Delta Lake, PySpark, Pandas, other data engineering frameworks and applying them to achieve industry standards-compliant datasets
  • Proven ability to handle and process large structured, semi-structured, and unstructured datasets efficiently
  • Strong communication skills and expertise to translate business needs into technical data requirements and processes
  • Ability to quantify and provide insights to business impact and value creation from data curation activities
  • Experience with at least one of the industry data standards such as CDISC(ODM: CDASH, SDTM, ADaM), HL7 FHIR, OMOP(CDM) etc.
Benefits
  • competitive salary
  • annual bonus based on company performance
  • healthcare and wellbeing programmes
  • pension plan membership
  • shares and savings programme
  • hybrid working model

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
PythonDatabricksDelta LakePySparkPandasdata curationdata integrationdata processingdata specificationdata analysis
Soft skills
strong communication skillscollaborationcoachingpeer reviewability to translate business needsinsight quantificationvalue creation
Certifications
BScMScPhD
Deutsche Bank

Specialist – Special Funds, Asset Reporting & Controlling

Deutsche Bank
Junior · Midfull-time🇩🇪 Germany
Posted: 8 hours agoSource: db.wd3.myworkdayjobs.com
GSK

Data Information Model Developer

GSK
Mid · Seniorfull-time🇩🇪 Germany
Posted: 9 hours agoSource: gsk.wd5.myworkdayjobs.com
ONVY HealthTech

Android Health Developer, Kotlin

ONVY HealthTech
Mid · Seniorcontract🇩🇪 Germany
Posted: 18 hours agoSource: join.com
AndroidGradleiOSJavaKotlinReactReact Native
fleetster

Senior React Native Developer

fleetster
Seniorfull-time$70k–$80k / year🇩🇪 Germany
Posted: 18 hours agoSource: join.com
AndroidiOSReactReact Native