About the role
- 将语言模型转化为现实应用,参与构建面向全球用户的 AI 系统
- 收集、清洗与预处理用户生成的文本与图像数据,用于大模型微调
- 设计并管理可扩展的数据标注流程,结合众包平台与内部标注团队
- 构建与维护用于内容审核的自动化数据集(例如:安全与不安全内容的识别)
- 与研究员及工程师协作,确保数据集具有高质量、多样性,并对齐模型训练目标
- Fine-tune state-of-the-art models, design evaluation frameworks, and bring AI features into production
- Collaborate with regional teams across product, engineering, operations, infrastructure and data
- Ensure datasets support model safety, trustworthiness, and scalability
Requirements
- 有为机器学习或大模型微调准备数据集的成熟经验
- 精通文本与图像数据的清洗、预处理与转换流程
- 熟悉数据标注工作流,并能对标注数据进行质量控制与管理
- 熟悉内容审核类数据集的构建与维护(例如安全性、合规性、过滤机制)
- 精通脚本编程(如 Python、SQL),并能处理大规模数据管道
- Proven experience preparing datasets for machine learning or fine-tuning large models
- Strong skills in data cleaning, preprocessing, and transformation for both text and image data
- Hands-on experience with data labeling workflows and quality assurance for labeled data
- Familiarity with building and maintaining moderation datasets (safety, compliance, and filtering)
- Proficiency in scripting (Python, SQL) and working with large-scale data pipelines
- 扁平化组织架构与真实项目主导权
- 全程参与产品方向制定与共识决策过程
- 灵活办公制度
- 高影响力岗位,跨产品、数据与工程多团队协作
- 顶尖市场薪酬与绩效奖金制度
- 全球化产品开发机会
- 丰厚福利:住房补贴、高质量公司食堂、加班餐补
- 健康、牙科与视力保险
- 全球差旅保险(适用于你与家属)
- 无限制、弹性带薪休假
- Flat structure & real ownership
- Full involvement in direction and consensus decision making
- Flexibility in work arrangement
- High-impact role with visibility across product, data, and engineering
- Top-of-market compensation and performance-based bonuses
- Global exposure to product development
- Lots of perks - housing rental subsidies, a quality company cafeteria, and overtime meals
- Health, dental & vision insurance
- Global travel insurance (for you & your dependents)
- Unlimited, flexible time off
ATS Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
machine learningdataset preparationdata cleaningdata preprocessingdata transformationdata labeling workflowsquality assurancescriptingPythonSQL
Soft skills
collaborationproject managementcommunication