
Mid-level Data Engineer, Cloudera
CI&T
full-time
Posted on:
Location Type: Remote
Location: Remote • 🇧🇷 Brazil
Visit company websiteJob Level
Mid-LevelSenior
Tech Stack
AzureCloudETLHDFSPySparkPythonSparkSQL
About the role
- Design, develop and maintain scalable, robust data pipelines.
- Build ingestion, transformation and data modeling solutions using Databricks, Spark/PySpark, Cloudera and Azure Data Factory (ADF).
- Ensure data quality, integrity and usability throughout the entire pipeline.
- Design and maintain Data Lake and Data Warehouse solutions, applying data governance best practices.
- Work with concepts and architectures such as Bronze, Silver and Gold layers, Star Schema, Delta Tables, Delta Sharing and analytical tables.
- Collaborate with technical and business teams to orchestrate solutions aligned with company objectives.
Requirements
- Solid experience as a Data Engineer or in a similar role.
- Advanced knowledge of:
- - SQL
- - Python / PySpark
- - Azure Databricks (workflows, jobs, Delta tables, queries, etc.)
- - Azure Data Factory (ADF)
- - Cloudera (Hive, Impala, HDFS, etc.)
- - ETL/ELT and data pipelines
- - Dimensional data modeling
- - Data Lake / Data Warehouse
- - Data governance and data quality
- - Experience with CI/CD practices applied to data environments
- - Strong communication and collaboration skills with multidisciplinary teams
- - Ability to operate in dynamic environments and handle loosely defined technical scope
- - Knowledge of code optimization in cloud environments
- Nice to have:
- - Experience with Teradata
- - Experience with SAS
Benefits
- Health and dental insurance
- Meal and food allowance
- Childcare assistance
- Extended parental leave
- Partnerships with gyms and health & wellness professionals via Wellhub (Gympass) / TotalPass
- Profit Sharing (PLR)
- Life insurance
- Continuous learning platform (CI&T University)
- Discount club
- Free online platform dedicated to promoting physical and mental health and wellbeing
- Pregnancy and parenthood courses
- Partnerships with online course platforms
- Language learning platform
- And many others
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
SQLPythonPySparkAzure DatabricksAzure Data FactoryClouderaETLELTDimensional data modelingData governance
Soft skills
communicationcollaborationability to operate in dynamic environmentshandling loosely defined technical scope