Salary
💰 CA$70,000 - CA$90,000 per year
Tech Stack
AirflowApachePythonSparkSQL
About the role
- Ensure the quality (accuracy, completeness, timeliness) of Continuous Monitoring customer data
- Run, monitor, and troubleshoot Apache Airflow data pipelines
- Leverage Apache Spark to design and implement scalable data processing pipelines, ensuring efficient handling of large datasets and high data quality
- Implement and maintain data validation frameworks using Great Expectations
- Write SQL queries to analyze data for various quality conditions
- Develop and maintain Python-based data ingestion pipelines and automated test scripts
- Debug issues and conduct exploratory testing to identify root cause of issues
- Implement test plans and create test scenarios and test cases
- Report defects and issues identified during the test process using a defect tracking system (Jira)
- Attend daily stand-ups to report on testing progress and identify and impediments
- Prioritize and implement multiple activities in parallel
- Engage in peer reviews, requirements, code, design, and tests
- Take ownership of the quality of the product, reproduce customer issues, and perform root cause analysis for test escapes
- Work with developers to capture root causes and supporting artifacts (e.g., logs)
Requirements
- Graduate of a university or college program in mathematics, computer science, programming, or engineering, or equivalent work experience
- 3+ years of experience in data engineering or data quality roles
- Proficiency in diagnosing data quality issues in SQL-like data warehouses
- Proficiency in solving data problems using Python as a primary language or through Python-based frameworks
- Experience with Apache Airflow or a similar Python orchestration tool
- Experience with Great Expectations or similar data validation tools
- Preferred experience in software quality assurance
- Preferred experience with CI/CD pipelines
- Preferred experience working with GitHub
- Strong interpersonal skills and ability to work effectively in a team environment
- Well-organized, result-oriented, self-motivated, collaborative
- Strong ability to prioritize tasks, troubleshoot problems, and resolve issues
- Work remotely within a flexible work environment (our team spans the US and Canada)
- Competitive company-paid benefits plan starting day 1!
- Generous professional development budget
- RRSP/ 401k matching program
- Half-day Fridays in the summer
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
SQLPythonApache AirflowApache SparkGreat Expectationsdata validation frameworksdata processing pipelinesdata qualityautomated test scriptsdefect tracking
Soft skills
interpersonal skillsteamworkorganizationself-motivationcollaborationprioritizationtroubleshootingproblem-solvingcommunicationresult-oriented