BJAK

数据工程师

BJAK

full-time

Posted on:

Location Type: Hybrid

Location: 🇨🇳 China

Visit company website
AI Apply
Apply

Job Level

Mid-LevelSenior

Tech Stack

PythonSQL

About the role

  • 将语言模型转化为现实应用,参与构建面向全球用户的 AI 系统
  • 收集、清洗与预处理用户生成的文本与图像数据,用于大模型微调
  • 设计并管理可扩展的数据标注流程,结合众包平台与内部标注团队
  • 构建与维护用于内容审核的自动化数据集(例如:安全与不安全内容的识别)
  • 与研究员及工程师协作,确保数据集具有高质量、多样性,并对齐模型训练目标
  • Fine-tune state-of-the-art models, design evaluation frameworks, and bring AI features into production
  • Collaborate with regional teams across product, engineering, operations, infrastructure and data
  • Ensure datasets support model safety, trustworthiness, and scalability

Requirements

  • 有为机器学习或大模型微调准备数据集的成熟经验
  • 精通文本与图像数据的清洗、预处理与转换流程
  • 熟悉数据标注工作流,并能对标注数据进行质量控制与管理
  • 熟悉内容审核类数据集的构建与维护(例如安全性、合规性、过滤机制)
  • 精通脚本编程(如 Python、SQL),并能处理大规模数据管道
  • Proven experience preparing datasets for machine learning or fine-tuning large models
  • Strong skills in data cleaning, preprocessing, and transformation for both text and image data
  • Hands-on experience with data labeling workflows and quality assurance for labeled data
  • Familiarity with building and maintaining moderation datasets (safety, compliance, and filtering)
  • Proficiency in scripting (Python, SQL) and working with large-scale data pipelines
Benefits
  • 扁平化组织架构与真实项目主导权
  • 全程参与产品方向制定与共识决策过程
  • 灵活办公制度
  • 高影响力岗位,跨产品、数据与工程多团队协作
  • 顶尖市场薪酬与绩效奖金制度
  • 全球化产品开发机会
  • 丰厚福利:住房补贴、高质量公司食堂、加班餐补
  • 健康、牙科与视力保险
  • 全球差旅保险(适用于你与家属)
  • 无限制、弹性带薪休假
  • Flat structure & real ownership
  • Full involvement in direction and consensus decision making
  • Flexibility in work arrangement
  • High-impact role with visibility across product, data, and engineering
  • Top-of-market compensation and performance-based bonuses
  • Global exposure to product development
  • Lots of perks - housing rental subsidies, a quality company cafeteria, and overtime meals
  • Health, dental & vision insurance
  • Global travel insurance (for you & your dependents)
  • Unlimited, flexible time off

ATS Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
machine learningdataset preparationdata cleaningdata preprocessingdata transformationdata labeling workflowsquality assurancescriptingPythonSQL
Soft skills
collaborationproject managementcommunication
1KOMMA5°

Working Student – Analytics Platform Engineer

1KOMMA5°
Entrypart-time🇩🇪 Germany
Posted: 24 days agoSource: 1komma5grad.jobs.personio.com
PythonSQL
Dropbox

Staff Data Scientist, AI Products

Dropbox
Leadfull-time$175k–$267k / year🇺🇸 United States
Posted: 3 days agoSource: boards.greenhouse.io
PythonSQL
Walmart

Senior Software Engineer

Walmart
Seniorfull-time$117k–$234k / yearCalifornia · 🇺🇸 United States
Posted: 11 hours agoSource: walmart.wd5.myworkdayjobs.com
Python
1KOMMA5°

Working Student, Data Engineering and Analytics Hardware

1KOMMA5°
Entryfull-time🇩🇪 Germany
Posted: 18 days agoSource: 1komma5grad.jobs.personio.com
PythonSQL
Upwork

Lead Data Engineer

Upwork
Seniorfull-time$152k–$216k / year🇺🇸 United States
Posted: 29 days agoSource: boards.greenhouse.io
PythonSQL