
Senior Data Engineer, Databricks
CI&T
full-time
Posted on:
Location Type: Remote
Location: Remote • 🇧🇷 Brazil
Visit company websiteJob Level
Senior
Tech Stack
AWSAzureCloudETLGoogle Cloud PlatformPySparkPythonSQL
About the role
- Develop and maintain data pipelines.
- Integrate data from various sources (APIs, relational databases, files, etc.).
- Collaborate with business and analytics teams to understand data requirements.
- Ensure quality, reliability, security and governance of the ingested data.
- Follow modern DataOps practices such as Code Versioning, Data Tests and CI/CD
- Document processes and best practices in data engineering.
Requirements
- Proven experience in building and managing large-scale data pipelines in Databricks (PySpark, Delta Lake, SQL).
- Strong programming skills in Python and SQL for data processing and transformation.
- Deep understanding of ETL/ELT frameworks, data warehousing, and distributed data processing.
- Hands-on experience with modern DataOps practices: version control (Git), CI/CD pipelines, automated testing, infrastructure-as-code.
- Familiarity with cloud platforms (AWS, Azure, or GCP) and related data services.
- Strong problem-solving skills with the ability to troubleshoot performance, scalability, and reliability issues.
- Proficiency in Git.
- Advanced English is essential (speaking)!
Benefits
- Health and dental insurance
- Meal and food allowance
- Childcare assistance
- Extended paternity leave
- Partnership with gyms and health and wellness professionals via Wellhub (Gympass) TotalPass;
- Profit Sharing and Results Participation (PLR);
- Life insurance
- Continuous learning platform (CI&T University);
- Discount club
- Free online platform dedicated to physical, mental, and overall well-being
- Pregnancy and responsible parenting course
- Partnerships with online learning platforms
- Language learning platform
- And many more!
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
data pipelinesDatabricksPySparkDelta LakeSQLETLELTdata warehousinginfrastructure-as-codeautomated testing
Soft skills
problem-solvingcollaborationcommunication