Tech Stack
AzureCyber SecurityETLPySparkPythonSQLTerraformVault
About the role
- Design and build modern Azure data platforms (ingestion, orchestration, storage zones, serving patterns)
- Define security/networking boundaries, cost/perf tradeoffs, and promotion strategy (Dev→Test→Prod)
- Implement robust ELT/ETL with ADF/Synapse Pipelines (parameters, reusable templates, CI/CD)
- Hands-on transformations, utilities, and tests using T-SQL and Python/PySpark
- Physical and semantic modeling, partitioning, columnstore strategies, statistics management, query plan analysis, and index design
- Implement observability and reliability: SLA/SLO definitions, Azure Monitor/Log Analytics/App Insights dashboards and alerts; error handling, retries/backoff, idempotency, CDC and schema drift strategies
- Enforce security and governance: RBAC, Key Vault, managed identities, private endpoints/VNet, data masking; document data contracts and access patterns
- Provide leadership through code reviews, PR discipline, mentoring, and documentation/runbooks for client handoff
- Deliver immediate impact on production-grade platforms for clients in Energy, Life Sciences, and Food & Beverage
Requirements
- 8–12+ years in data engineering (recent Azure focus)
- Expert with ADF (linked services, datasets, IRs—including self-hosted)
- Synapse (SQL pools/serverless, pipelines)
- ADLS/Blob storage experience
- T-SQL: advanced query tuning, execution plan analysis, windowing, TVFs/stored procs, temp tables vs CTE tradeoffs, cardinality estimator know-how
- Python/PySpark: production data transforms, packaging, and testing
- CI/CD: Azure DevOps or GitHub Actions (multi-stage releases, approvals, infra + data deployments)
- Proven delivery of production-grade platforms at scale (TB-level data, strict SLAs)
- Experience with ELT/ETL tooling and pipeline design (ADF/Synapse Pipelines)
- Database performance: partitioning, columnstore, statistics management, query plan analysis, index design, concurrency & transaction isolation, workload management
- Observability: Azure Monitor / Log Analytics / App Insights, SLA/SLO definitions, dashboards and alerts
- Security: RBAC, Key Vault, managed identities, private endpoints/VNet, data masking patterns
- Contract-only: 1099/C2C; no C2H
- Must operate under your own LLC or work through a staffing agency
- Required for contractors: General Liability Insurance and Cybersecurity Insurance
- Location requirement: Remote (U.S.) with core overlap with CST
- Nice to have: data validation procedures, experience with large SQL tables (~100M rows), IaC (Bicep/Terraform), event-driven integration (Service Bus/Event Grid, CDC tooling), DP-203 or AZ-204 certs