Salary
💰 $94,000 - $104,000 per year
About the role
- Collaborate with data scientists to prepare, structure, and format raw documents for AI model training.
- Perform document anonymization and redaction to ensure compliance with data privacy regulations (e.g., GDPR, CCPA).
- Conduct document labeling and tagging for supervised learning datasets.
- Convert unstructured documents (PDFs, Word, scanned content) into structured formats (e.g., JSON, tables).
- Analyze outputs from LLMs and other models, comparing them against ground truth.
- Conduct manual validation of AI-generated results, ensuring quality and business relevance.
- Track model accuracy, identify gaps, and provide insights on functional alignment.
- Prepare documentation, test cases, and assist with knowledge base creation.
Requirements
- Bachelor's degree or above in Computer Science, Information Systems, Business Analytics, or related fields.
- Experience working in AI/ML or data science project environments.
- Understanding of machine learning pipelines and supervised learning concepts.
- Prior experience in model evaluation or QA for AI outputs is a strong plus.
- Fluency in Mandarin is a strong plus (for document processing and team collaboration).
- Experience working in Agile teams or fast-paced environments.
- 15 days per year of Paid Time Off (PTO)
- 8 paid holidays + 1 personal floating holiday
- 401(k) retirement plan with company match.
- Medical, dental, and vision insurance
- Health savings account (HSA)
- Short-term and long-term disability
- Employee assistance plan (EAP)
- Basic life and AD&D insurance
- Health care flexible spending account
- Dependent care flexible spending account
- Commuter benefits
- Voluntary accident & critical injury coverage
- Voluntary long-term care coverage
- Voluntary life and AD&D insurance
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
document anonymizationdocument redactiondocument labelingdocument taggingdata privacy regulationsmachine learning pipelinessupervised learningmodel evaluationquality assurancedata formatting
Soft skills
collaborationanalytical skillsattention to detailcommunicationproblem-solvingadaptabilityteamworkinsight generationdocumentation skillsvalidation skills