BJAK

Data Engineer

BJAK

full-time

Posted on:

Location: 🇺🇸 United States

Visit company website
AI Apply
Apply

Job Level

Mid-LevelSenior

Tech Stack

PythonSQL

About the role

  • Collect, clean, and preprocess user-generated text and image data for fine-tuning large models
  • Design and manage scalable data labeling pipelines, leveraging both crowdsourcing and in-house labeling teams
  • Build and maintain automated datasets for content moderation (e.g., safe vs unsafe content)
  • Collaborate with researchers and engineers to ensure datasets are high-quality, diverse, and aligned with model training needs

Requirements

  • Proven experience preparing datasets for machine learning or fine-tuning large models
  • Strong skills in data cleaning, preprocessing, and transformation for both text and image data
  • Hands-on experience with data labeling workflows and quality assurance for labeled data
  • Familiarity with building and maintaining moderation datasets (safety, compliance, and filtering)
  • Proficiency in scripting (Python, SQL) and working with large-scale data pipelines
TASC

Principal Data Engineer

TASC
Leadfull-time$138k–$265k / yearMontana, New York · 🇺🇸 United States
Posted: 37 minutes agoSource: mastercard.wd1.myworkdayjobs.com
AWSAzureCloudDistributed SystemsETLHadoopJenkinsPySparkPythonSQL
Woolpert

Cloud Data Engineer

Woolpert
Juniorfull-time$60k–$65k / year🇺🇸 United States
Posted: 2 hours agoSource: boards.greenhouse.io
AzureCloudJavaScriptPythonSQLTableau
General Dynamics Information Technology

Cloud/Data Engineer – TS/SCI with Polygraph

General Dynamics Information Technology
Senior · Leadfull-time$161k–$207k / yearVirginia · 🇺🇸 United States
Posted: 3 hours agoSource: gdit.wd5.myworkdayjobs.com
ApacheAWSCloudSpark
Blizzard Entertainment

Data Engineer

Blizzard Entertainment
Mid · Seniorfull-time$78k–$143k / yearCalifornia · 🇺🇸 United States
Posted: 4 hours agoSource: activision.wd1.myworkdayjobs.com
AirflowAWSAzureBigQueryCloudETLGoogle Cloud PlatformOraclePython
Allata

Data Architect – Databricks

Allata
Mid · Seniorfull-timeTexas · 🇺🇸 United States
Posted: 8 hours agoSource: jobs.lever.co
AirflowAWSAzureCloudETLGrafanaJenkinsMS SQL ServerOraclePostgresPrometheusPySpark+4 more