Tech Stack
ApacheAWSEC2JavaScriptNode.jsPythonScalaSDLCSparkTypeScript
About the role
- Design, develop and support a variety of internally developed applications, tools and platforms focused on research data management.
- Contribute to ongoing data management processes with automations fueled by AI/ML, utilizing GenAI, Agentic AI and emerging LLM models.
- Develop and support data pipelines, implement necessary integration work, and manage electronic data collection and access in accordance with BMS policies and project expectations.
- Development and testing of data pipelines, metadata management, lifecycle management, data quality assurance, data management (parsing, transformations, extraction), data enrichment through scripting, and contributing to solution design.
- Develop documentation, APIs, data models, transformation rules to facilitate knowledge of scientific data repositories.
- Collaborate with cross-functional teams to design, develop, and maintain software solutions.
- Explore and integrate AI and LLM technologies for productivity benefits.
- Develop frameworks and automations for managing complex data pipelines, monitor data quality issues and manage data access.
- Provide technical expertise in the capture and transformation of complex scientific data.
- Understand the needs of the business and commit to delivering the best user experience and adoption practices.
Requirements
- BS/MS in STEM (Computer Science, Engineering) or a degree in life sciences with computer science expertise.
- 2+ years of relevant industry experience, or a strong technical graduate.
- Experience/expertise in database development, data transformations, systems and data integration, building pipelines and data lake management.
- Experience leveraging AI tools and LLM models for productivity improvement in software development, data management and automation.
- Proficient with programming languages like Python and JavaScript.
- Hands-on experience with Scala and Node.js is an asset.
- Experience with AWS Web Services for storage management, application deployment, access control, messaging, serverless architecture, API development, database development etc.
- Demonstrated enthusiasm and curiosity for AI, LLMs, and emerging technologies.
- Strong aptitude for software development and problem-solving.