Tech Stack
AWSCloudGoogle Cloud PlatformNumpyPandasPythonPyTorchScikit-LearnTensorflow
About the role
- Help build automation platforms and infrastructure that power drug discovery projects.
- Work with the Data Science team to design, build, optimize, and maintain data processing pipelines.
- Build and maintain a database of biological images, assay readouts, and other data to enable rapid data retrieval and visualization.
- Develop and maintain core APIs for data analysis and visualization.
- Create tools, dashboards, and metrics to help lab operations keep track of the quality and timeliness of the data they generate.
- Collaborate across teams and communicate with technical and non-technical stakeholders.
Requirements
- 3+ years software engineering industry experience.
- High fluency with the Python data stack (numpy, pandas, sklearn, etc).
- Demonstrated ability to write high-quality, production-ready code.
- Experience automating deployments, logging, collecting metrics, and monitoring jobs.
- Database experience, including schema design and population.
- Strong desire to work collaboratively in and across teams.
- Ability to communicate with technical and non-technical stakeholders.
- Eligible to work in the United States.
- Experience with cloud computing services (AWS or GCP).
- Experience with PyTorch and Tensorflow.
- Scalable machine learning experience, including application to large datasets (100TB+).
- Experience with biological data (sequence, proteomics, images, etc).