Engineer II, Data

CarMax

Data Engineer II at CarMax responsible for building and maintaining production-grade data pipelines. Collaborating with analytics teams and ensuring data quality and governance in a cloud environment.

Posted 4/14/2026full-timeDallas • Texas • 🇺🇸 United StatesJuniorMid-LevelWebsite

Tech Stack

Tools & technologies

ApacheAWSAzureCloudGoogle Cloud PlatformKafkaPythonSpark

About the role

Key responsibilities & impact

Design, build, and maintain production-grade data pipelines (streaming and batch)
Implement event-driven ingestion and near real‑time processing
Build transformations using distributed data processing frameworks
Write and maintain clean, testable Python code following software engineering best practices
Support CI/CD pipelines, automated deployments, and environment promotions
Monitor and troubleshoot pipeline failures, latency issues, and data quality problems
Collaborate closely with Data Scientists, Analysts, and platform engineers

Requirements

What you’ll need

2+ years of experience as a Data Engineer or Software Engineer working with data
Strong experience writing production Python code
Experience building and supporting pipelines on Azure or an equivalent cloud platform (AWS/GCP)
Hands‑on experience with distributed data processing, such as: Apache Spark, Databricks (or Spark runners such as EMR, Dataproc)
Experience working with event streaming or messaging platforms, such as: Azure Event Hubs, Apache Kafka / Confluent Kafka, Amazon Kinesis
Practical experience with CI/CD pipelines, version control, and automated deployments
Familiarity with data modeling, schema management, and data reliability concepts
Experience operating systems in an agile, collaborative engineering team

Benefits

Comp & perks

Health insurance
401(k) matching
Flexible work hours
Paid time off
Professional development opportunities

ATS Keywords

✓ Tailor your resume

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools

Pythondata pipelinesevent-driven ingestionreal-time processingdistributed data processingApache SparkDatabricksCI/CDdata modelingschema management

Soft Skills

collaborationtroubleshootingproblem-solvingcommunicationagile methodology