qode.world

Data Architect

qode.world

full-time

Posted on:

Location Type: Hybrid

Location: DallasNew JerseyTexasUnited States

Visit company website

Explore more

AI Apply
Apply

About the role

  • Provide technical solution discovery effort on any new capabilities or new functionality.
  • Assist PO with technical user stories to ensure healthy backlog features
  • Lead the development of real-time data pipelines using AWS DMS, MSK, Kafka or Glue Streaming and for CDC ingestion from multiple SQL Server sources (RDS/on-prem).
  • Build and optimize streaming and batch data pipelines using AWS Glue (PySpark) to validate, transform, and normalize data to Iceberg and DynamoDB.
  • Define and enforce data quality, lineage, and reconciliation logic with support for both streaming and batch use cases.
  • Integrate with S3 Bronze/Silver layers and implement efficient schema evolution and partitioning strategies using Iceberg.
  • Collaborate with architects, analysts, and downstream application teams to design API and file-based egress layers.
  • Implement monitoring, logging, and event-based alerting using CloudWatch, SNS, and EventBridge.
  • Mentor junior developers and enforce best practices for modular, secure, and scalable data pipeline development.

Requirements

  • 6+ years of hands-on expert level data engineering experience in cloud-based environments (AWS preferred) with event driven implementation
  • Strong experience with Apache Kafka / AWS MSK including topic design, partitioning, and Kafka Connect/Debezium
  • Proficiency in AWS Glue (PySpark) and for both batch and streaming ETL
  • Working knowledge of AWS DMS, S3, Lake Formation, DynamoDB, and Iceberg
  • Solid grasp of schema evolution, CDC patterns, and data reconciliation frameworks
  • Experience with infrastructure-as-code (CDK/Terraform) and DevOps practices (CI/CD,Git
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
data engineeringreal-time data pipelinesAWS DMSAWS GluePySparkApache KafkaCDC ingestiondata qualityschema evolutioninfrastructure-as-code
Soft Skills
mentoringcollaborationbest practices enforcement