
Data Engineer – MDM Experience
Stefanini Brasil
full-time
Posted on:
Location Type: Remote
Location: Remote • 🇧🇷 Brazil
Visit company websiteJob Level
Mid-LevelSenior
Tech Stack
ETLPySparkPythonSQL
About the role
- Data Modeling Architecture: Design and implement the necessary architecture for Patient MDM.
- Business Rules Documentation: Specify and document the business rules that will guide data unification and processing.
- Processing Pipelines: Implement data pipelines in Databricks to ensure efficient processing.
- Matching and Deduplication Logic: Develop matching, deduplication, and golden record generation logic for patient records.
- Data Quality: Establish processes to ensure data quality and validation.
- Table Maintenance: Create and maintain integrated tables according to the defined business rules.
- Update Routines: Implement routines for updating and synchronizing data.
- Code Reviews and Mentoring: Participate in code reviews and provide technical mentorship to the team.
- Stakeholder Collaboration: Work with business stakeholders to refine and validate rules and requirements.
Requirements
- Advanced experience in Python.
- Strong knowledge of Databricks and Delta Lake.
- PySpark for processing large volumes of data.
- SQL and data modeling.
- ETL/ELT and data pipelines.
- Previous experience in Master Data Management (MDM) projects or data integrations.
- Knowledge of matching techniques, fuzzy matching, and record deduplication.
- Ability to implement complex business rules in code.
- Proficiency with Git and agile methodologies.
- Familiarity with development best practices, including unit testing and documentation.
Benefits
- Meal allowance or meal voucher.
- Discounts on courses, universities, and language schools.
- Stefanini Academy — online platform with free, up-to-date courses and certificates.
- Mentoring.
- Benefits club for medical consultations and exams.
- Health insurance.
- Dental insurance.
- Perks and discounts at leading establishments.
- Travel club.
- Pet plan.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
PythonDatabricksDelta LakePySparkSQLETLELTdata modelingmatching techniquesrecord deduplication
Soft skills
code reviewsmentoringstakeholder collaboration