Data Engineer

• Own our SQL data estate, design scalable pipelines, and lead data enrichment across our Azure-first platform.
• Set the standards for modelling, quality, security, and cost while writing production-grade Python and SQL daily.
• Design and evolve schemas for OLTP/OLAP (Azure SQL, Synapse, Delta Lake), with partitioning, indexing, and RLS for multi-tenant isolation.
• Establish data contracts and versioning, govern schema evolution, and implement CDC +SCD patterns.
• Performance engineering: query tuning, resource classes, caching strategies, and cost guardrails.
• Architect ELT/ETL across batch & streaming using Azure Data Factory/Synapse/Databricks, Event Hubs/Service Bus, Functions, and Container Apps/AKS.
• Build reliable, observable pipelines (idempotent, retryable, lineage-aware) with SLAs/SLOs and runbooks.
• Implement CI/CD for data (dbt/SQL projects, PySpark jobs, tests) using GitHub Actions and IaC (Terraform/Bicep).
• Define and operate enrichment layers: UPC/GS1, OCR/EXIF metadata, taxonomies, embeddings, and third-party data joins.
• Curate gold/semantic models for analytics & product APIs; manage feature/metric definitions and documentation.
• Partner with DS/ML to operationalize feature stores, model outputs, drift signals, and evaluation tables.
• Own reference architecture across ADLS Gen2, Synapse/Databricks, Azure SQL/SQL Server, Cosmos DB (incl. vector), Azure AI Search, Key Vault, Purview.
• Security & compliance by default: encryption, secret management, RBAC/ABAC, data retention and GDPR/SOC 2 controls.
• Observability: OpenTelemetry + Azure Monitor/App Insights, data quality tests, freshness SLAs, and lineage in Purview.

Data Architect

GCP Data Architect

Data Migration and Reporting Specialist

Senior Data Engineer

Data Warehouse Developer – Level II