Senior Data Developer, Azure

CI&T

Engenheiro de Dados na CI&T, com foco em transformação tecnológica e soluções baseadas em IA. Responsável por implementar pipelines e garantir qualidade de dados.

Posted 5/15/2026full-timeRemote • 🇧🇷 BrazilSeniorWebsite

Tech Stack

Tools & technologies

AirflowApacheAzureETLPySparkSQLUnity

About the role

Key responsibilities & impact

Pipeline Development: Implement ELT/ETL pipelines for ingesting, transforming, and delivering data from diverse sources, including spreadsheets, SharePoint, and relational databases, following architecture standards;
Data Platform Construction: Build and organize the layers of the Data Lake/Lakehouse (Bronze, Silver and Gold / Medallion Architecture) using optimized formats such as Delta Lake and Parquet;
Transformation and Modeling: Develop data transformations using PySpark and advanced SQL, implementing architecture-defined models in the platform's analytical layers;
Ingestion Strategies: Implement loading strategies (full load or incremental) appropriate to the volume and criticality of each data domain — focusing on efficiency for small-data contexts;
Data Quality: Implement automated data quality tests and checks within pipelines, ensuring consistency, integrity, and traceability across layers;
Cataloging and Governance: Support the cataloging and documentation of data assets with emphasis on lineage and classification, following governance guidelines defined by the Architect, establishing a unified data catalog as a central element of platform management;
Observability: Ensure observability of pipelines through logs, alerts, and proactive monitoring;
Documentation: Document models, transformations, and business rules applied in pipelines, ensuring traceability and maintainability of the solution.

Requirements

What you’ll need

Proven experience in Data Engineering in cloud environments;
Strong experience with PySpark for developing pipelines and data transformations on distributed platforms;
Experience with advanced SQL for queries, transformations, and analytical modeling;
Knowledge of Microsoft Azure (Azure Data Factory, ADLS Gen2 and/or Microsoft Fabric);
Experience with data formats such as Delta Lake, Parquet, or ORC;
Experience with Data Lake, Lakehouse, and Medallion Architecture;
Experience with data cataloging and lineage, including Unity Catalog (Databricks);
Preferred / Nice to Have:
Experience with Databricks and/or Snowflake as a data processing platform;
Familiarity with Microsoft Fabric (Lakehouses, Notebooks, Pipelines or Dataflows);
Experience with Microsoft Purview for data cataloging and lineage;
Knowledge of dbt for transformations and data quality testing;
Experience orchestrating pipelines with Apache Airflow or Prefect;
DP-700 certification or equivalent;
Development best practices: version control with Git, automated testing, documentation, and CI/CD;
Familiarity with Gen AI and autonomous agents applied to data.

Benefits

Comp & perks

Health and dental insurance;
Meal and food allowance;
Childcare assistance;
Extended parental leave;
Partnerships with gyms and health and wellness professionals via Wellhub (Gympass) / TotalPass;
Profit Sharing (PLR);
Life insurance;
Continuous learning platform (CI&T University);
Discount club;
Free online platform dedicated to promoting physical and mental health and wellbeing;
Expectant parent and responsible parenthood course;
Partnerships with online course platforms;
Language learning platform;
And many others

ATS Keywords

✓ Tailor your resume

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools

Data EngineeringPySparkSQLDelta LakeParquetData LakeLakehouseMedallion Architecturedata catalogingdata quality testing

Soft Skills

documentationobservabilitydata governance

Certifications

DP-700