Curotec

Senior Data Engineer

Curotec

full-time

Posted on:

Origin:  • 🇺🇸 United States • New York

Visit company website
AI Apply
Apply

Job Level

Senior

Tech Stack

AzureCloudPySparkPython

About the role

  • Support the ingestion, processing, and synchronization of data across the analytics platform
  • Build and maintain Python Notebooks to ingest data from third-party APIs
  • Design and implement Medallion layer architecture (Bronze, Silver, Gold) for structured data organization and progressive data refinement
  • Store and manage data within Microsoft Fabric's Data Lake and Data Warehouse using delta parquet file formats
  • Set up data pipelines and sync key datasets to Azure Synapse Analytics
  • Develop PySpark-based data transformation processes across Bronze, Silver, and Gold layers
  • Collaborate with developers, analysts, and stakeholders to ensure data availability and accuracy
  • Monitor, test, and optimize data flows for reliability and performance
  • Document processes and contribute to best practices for data ingestion and transformation

Requirements

  • Strong experience with Python for data ingestion and transformation
  • Proficiency with PySpark for large-scale data processing
  • Proficiency in working with RESTful APIs and handling large datasets
  • Experience with Microsoft Fabric or similar modern data platforms
  • Understanding of Medallion architecture (Bronze, Silver, Gold layers) and data lakehouse concepts
  • Experience working with Delta Lake and parquet file formats
  • Understanding of data warehousing concepts and performance tuning
  • Familiarity with cloud-based workflows, especially within the Azure ecosystem
  • Hands-on experience working with API-based data ingestion and modern data architectures
  • Nice to have: Experience with marketing APIs such as Google Ads or Google Analytics 4
  • Nice to have: Familiarity with Azure Synapse and Data Factory pipeline design
  • Nice to have: Understanding of data modeling for analytics and reporting use cases
  • Nice to have: Experience with AI coding tools
  • Nice to have: Experience with Fivetran, Aribyte, and Riverly