Salary
💰 CA$94,220 - CA$174,980 per year
Tech Stack
AzureCloudHDFSMapReduceNoSQLPySparkScala
About the role
- Design, build, and maintain reliable, efficient, and scalable data infrastructure for collection, storage, transformation, and analysis
- Implement data orchestration pipelines, data sourcing, cleansing, augmentation, and quality control processes
- Collaborate with business and technology partners to understand data infrastructure requirements
- Design, build, and maintain scalable data solutions including data pipelines, data models, and applications for ML workflows
- Design, implement, and maintain data platforms (data warehouses, data lakes, lakehouses) for structured and unstructured data
- Work with Data Scientists and Engineers to create features and pre-process data and move models into production
- Develop analytical tools, algorithms, and modernization roadmaps; write scripts and automate tasks
- Monitor data integrity and apply appropriate data interchange formats
- Deploy machine learning models into production, integrating with existing systems, data sources, and APIs
- Promote data querying APIs, data versioning, and lineage tracking for transparency and traceability
Requirements
- Knowledge of database systems, data lakes, and NoSQL databases
- Knowledge of data warehouse concepts and architectures (e.g., Synapse & Datbricks)
- Familiarity with data quality and data modelling tools
- Proficiency using version control systems like Git
- Experience with cloud-native data services such as PySpark, Scala, Azure Data Factory and Databricks
- Proficiency in data processing frameworks and techniques such as HDFS, MapReduce, storage formats (Avro, Parquet), and stream processing
- Experience integrating with back-end/legacy environments
- Knowledge of AI model deployment in production environments
- Experience handling real-time data for AI applications
- Ability to build and deploy DataOps and MLOps pipelines in cloud-native environments
- Preferred: 5-7 years experience in Data Engineering or Software Development
- Preferred: Bachelor's or Master's in Engineering or Data Science