Tech Stack
AWSAzureCloudDockerETLGoogle Cloud PlatformKubernetesNoSQLPythonSQLTableau
About the role
- Design, build, and optimize advanced machine learning models for demand forecasting, customer transcript analysis, marketing ROI optimization, and product return prediction
- Develop and maintain data processing and ETL pipelines using Python and SQL for data manipulation, automation, and integration with ML workflows
- Leverage Microsoft data and AI technologies (Azure Fabric, Azure Machine Learning, AutoML, Azure Data Lake, SQL Server, NoSQL, Copilot Studio)
- Use Azure Data Factory to orchestrate data pipelines and automated model training processes
- Partner with data engineers to prepare and structure data following medallion architecture (bronze, silver, gold layers)
- Perform data preprocessing, feature engineering, and statistical analysis to ensure high-quality model inputs
- Evaluate, test, and refine models for accuracy, scalability, and business impact; implement automated testing, CI/CD, and model monitoring for production readiness
- Integrate and visualize model outputs and insights using Power BI, Tableau, and Looker; apply data warehousing concepts for reporting
- Write advanced analytic queries and optimize SQL performance on large-scale data platforms
- Collaborate with cross-functional teams (engineering, BI analysts, business leaders) to translate data insights into business strategies
- Adhere to software engineering best practices including Infrastructure as Code, modularity, code quality, and maintainability
- Stay current with advancements in AI, machine learning, and the Microsoft ecosystem to improve solutions
Requirements
- Bachelor’s or Master’s degree in Computer Science, Statistics, Mathematics, or a related field
- 5+ years of experience in data science or applied machine learning
- Advanced proficiency in Python for ML pipelines, data processing, ETL, and MLOps
- Advanced proficiency in SQL, including advanced analytic queries and performance optimization on large-scale platforms
- Proven ability to develop predictive models and deliver business impact
- Experience with Microsoft data stack: Azure Fabric, Azure Machine Learning, AutoML, SQL Server, NoSQL, Azure Data Lake
- Experience designing and managing Azure-based ML pipelines and leveraging AutoML
- Experience with data pipeline orchestration using Azure Data Factory
- Familiarity with medallion architecture (bronze/silver/gold layers) and modern data engineering practices
- Experience collaborating with data engineers to structure data using medallion architecture
- Intermediate knowledge of data warehousing concepts
- Experience with BI tools: Power BI, Tableau, Looker
- Strong statistical analysis and feature engineering skills
- Knowledge of software engineering best practices for ML production: Infrastructure as Code, separation of environments, modularity, automated testing, CI/CD pipelines, model monitoring
- Excellent communication skills
- Nice to have: experience with AWS or Google Cloud Platform, Python visualization libraries (Matplotlib, Seaborn, Plotly), data governance or data security experience, Azure/data science/machine learning certifications, agile methodology experience, industry background in entertainment/finance/logistics, containerization with Docker/Kubernetes, mentoring experience