Salary
💰 $125,000 - $145,000 per year
Tech Stack
CloudNumpyPandasPySparkPythonScalaScikit-LearnSpark
About the role
- Build in more automated processes with latest AI tools to improve machine learning models performance and efficiency of cloud-based data pipelines
- Work closely with Software Engineering and Product to seamlessly integrate data science code into the production code of our software.
- Actively engage in R&D to continually scale the impact of BlueConduit’s predictive methods
- Use machine learning pipelines and other internal tools to provide risk predictions for water distribution assets
- Support non-technical clients with data analysis and clear communication of model results on tight timelines
Requirements
- Curiosity to learn and commitment to the human side of data science
- Passion for data science for social good and environmental justice
- Independent problem-solving
- Excellent verbal and written communication skills
- Undergraduate degree in quantitative field (e.g., CS, math, stats, physics, etc.)
- Substantial experience with Python, especially pandas, Scikit-learn, PySpark, and numpy
- Extensive experience (~5+ years) with machine learning and statistical models, including validation and evaluation of model performance
- Machine Learning Operations (MLOps) experience
- Experience building production code based on solid data science work
- Experience building and improving machine learning models and data pipelines
- Experience using latest AI tools for implementing ML/ DS work in production software
- Experience deploying, monitoring, and maintaining machine learning models in production environments
- Experience building production-level ML pipelines using PySpark (or Spark in Scala)
- Experience with issues related to modeling (e.g., selection biases, causal inference)
- Experience working with messy data, iterating with clients on a shared dataset
- Ability to build and maintain strong documentation habits
- Proficiency with Git workflow
- Experience building models and pipelines in Databricks
- Stock options
- Health benefits (100% coverage of medical premiums for a base plan or a portion of a premium plan; Vision and Dental also available)
- Simple IRA benefit (3% company contribution matching)
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
PythonpandasScikit-learnPySparknumpymachine learningstatistical modelsMLOpsdata pipelinesmodel validation
Soft skills
curiositycommitment to human side of data scienceindependent problem-solvingverbal communicationwritten communication