Tech Stack
AzureCloudETLNumpyPandasPySparkPythonSQL
About the role
- Design, develop, and maintain robust ETL/ELT pipelines using Python and modern data engineering tools
- Design, build, and maintain scalable data pipelines and solutions that support business intelligence, analytics, and machine learning initiatives
- Work with structured and unstructured data from various sources including APIs, databases, and cloud platforms
- Optimize data workflows for performance, scalability, and reliability
- Implement data quality checks, monitoring, and alerting mechanisms
- Maintain and enhance data models and schemas for analytics and reporting
- Ensure data security, compliance, and governance standards are met
- Collaborate with data scientists, analysts, and business stakeholders to understand data requirements
Requirements
- 5+ years of experience in data engineering or related roles
- Strong proficiency in Python (Pandas, NumPy, PySpark)
- Strong foundation in data engineering principles
- Experience designing, building, and maintaining ETL/ELT pipelines
- Experience with data pipeline orchestration tools (e.g., Azure Data Factory)
- Proficiency in SQL and relational databases (e.g., SQL Server)
- Familiarity with Azure cloud and cloud-native data services
- Understanding of data pipeline, databases and data warehousing concepts and tools
- Knowledge of version control systems and CI/CD practices
- Experience working with structured and unstructured data, APIs, databases, and cloud platforms
- Preferred: experience with Azure Databricks
- Preferred: knowledge of data governance and metadata management tools
- Preferred: familiarity with machine learning workflows and model deployment