- Develop and apply patent data-gathering and mining approaches to build high quality foundation data sets.
- Process and analyze small- and large-chemical and biological intellectual property datasets, including therapeutic use, molecular target and chemical structure data.
- Integrate chemical and biological databases to patent data.
- Develop novel data mining approaches to unearth cryptic data that enables decision making in partner organizations.
- Optimize data pipelines for processing and storing patent data, using text-mining, cheminformatics and bioinformatics approaches.
- Collaborate with cross-functional teams to integrate computational approaches into curation and analysis workflows.
- Contribute to thought leader articles on drug intellectual property informatics and data mining.
- Maintain best practices in data integrity, reproducibility, and documentation of data sources and derived content.
- Other responsibilities as required to support product development and maintenance.
Requirements
**Required Qualifications:**
- Ph.D. or Master’s degree in Cheminformatics, Computational Chemistry, Bioinformatics, Data Science, or a related field.
- 5-10+ years of experience in cheminformatics, computational drug discovery, or machine learning applications in chemistry.
- Proficiency in Python/R, with experience in cheminformatics libraries and topics.
- Strong knowledge of molecular descriptors, drug targets, and chemical/biological informatics techniques.
- An innate sense of how to query and derive value from patent data.
- Familiarity with Open Source and academic/commercial competitive intelligence/patent systems.
- Experience working in a structured collaborative data and software development environment (git, SQL/Postgres, python notebooks).
- Exceptional communication skills in written and verbal communication of science, a natural story-teller to make sense and provide insights from complex data.
**Preferred Qualifications:**
- Understanding of regulatory and patent landscapes for chemical and pharmaceutical data.
- Text mining experience, NER/NLP. Existing expertise in Python and relational database systems. API development and systems architecture.
- We understand that we are looking for a broad range of skills, so are committed to on the job coaching from experienced team members.
Benefits
**What We Offer:**
- Competitive salary and stock options.
- Active mentoring in data science/drug discovery within a highly experienced team
- Professional development opportunities and conference sponsorship.
- A collaborative environment working on cutting-edge computational drug discovery and data science.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
cheminformaticscomputational chemistrybioinformaticsdata miningmachine learningPythonRmolecular descriptorstext miningAPI development
Soft skills
communication skillscollaborationstorytellingdata integrityreproducibilitydocumentation