Lean Tech

Senior Data Scientist

Lean Tech

full-time

Posted on:

Origin:  • 🇺🇸 United States

Visit company website
AI Apply
Manual Apply

Job Level

Senior

Tech Stack

AWSAzureCloudDockerETLGoogle Cloud PlatformKubernetesNoSQLPythonSQLTableau

About the role

  • Design, build, and optimize advanced machine learning models for demand forecasting, customer transcript analysis, marketing ROI optimization, and product return prediction
  • Develop and maintain data processing and ETL pipelines using Python and SQL for data manipulation, automation, and integration with ML workflows
  • Leverage Microsoft data and AI technologies (Azure Fabric, Azure Machine Learning, AutoML, Azure Data Lake, SQL Server, NoSQL, Copilot Studio)
  • Use Azure Data Factory to orchestrate data pipelines and automated model training processes
  • Partner with data engineers to prepare and structure data following medallion architecture (bronze, silver, gold layers)
  • Perform data preprocessing, feature engineering, and statistical analysis to ensure high-quality model inputs
  • Evaluate, test, and refine models for accuracy, scalability, and business impact; implement automated testing, CI/CD, and model monitoring for production readiness
  • Integrate and visualize model outputs and insights using Power BI, Tableau, and Looker; apply data warehousing concepts for reporting
  • Write advanced analytic queries and optimize SQL performance on large-scale data platforms
  • Collaborate with cross-functional teams (engineering, BI analysts, business leaders) to translate data insights into business strategies
  • Adhere to software engineering best practices including Infrastructure as Code, modularity, code quality, and maintainability
  • Stay current with advancements in AI, machine learning, and the Microsoft ecosystem to improve solutions

Requirements

  • Bachelor’s or Master’s degree in Computer Science, Statistics, Mathematics, or a related field
  • 5+ years of experience in data science or applied machine learning
  • Advanced proficiency in Python for ML pipelines, data processing, ETL, and MLOps
  • Advanced proficiency in SQL, including advanced analytic queries and performance optimization on large-scale platforms
  • Proven ability to develop predictive models and deliver business impact
  • Experience with Microsoft data stack: Azure Fabric, Azure Machine Learning, AutoML, SQL Server, NoSQL, Azure Data Lake
  • Experience designing and managing Azure-based ML pipelines and leveraging AutoML
  • Experience with data pipeline orchestration using Azure Data Factory
  • Familiarity with medallion architecture (bronze/silver/gold layers) and modern data engineering practices
  • Experience collaborating with data engineers to structure data using medallion architecture
  • Intermediate knowledge of data warehousing concepts
  • Experience with BI tools: Power BI, Tableau, Looker
  • Strong statistical analysis and feature engineering skills
  • Knowledge of software engineering best practices for ML production: Infrastructure as Code, separation of environments, modularity, automated testing, CI/CD pipelines, model monitoring
  • Excellent communication skills
  • Nice to have: experience with AWS or Google Cloud Platform, Python visualization libraries (Matplotlib, Seaborn, Plotly), data governance or data security experience, Azure/data science/machine learning certifications, agile methodology experience, industry background in entertainment/finance/logistics, containerization with Docker/Kubernetes, mentoring experience