
Solutions Engineer – Media
Grupo Protege
full-time
Posted on:
Location Type: Remote
Location: Brazil
Visit company websiteExplore more
Tech Stack
About the role
- Own data quality and curate media datasets
- Partner with Sales and Solutions to translate customer requirements into curation strategies
- Work with imperfect partner data, including mismatched metadata, schema differences, and incomplete labeling
- Normalize and standardize datasets for reliable downstream use
- Query and analyze Protege’s media catalog using SQL, internal APIs, and metadata tools to identify relevant content
- Build validation checks and workflows to ensure dataset integrity before delivery
- Identify, debug, and resolve data quality issues across file structures, metadata, and content alignment
- Use AI tools and transcoded embeddings to surface and refine clip-level content
- Turn messy, real-world data into structured datasets that meet customer and model requirements
- Run iterative sample reviews with customers, incorporate feedback, refine selections, and ensure final packages meet spec
- Build deep expertise in Protege’s media catalog structure, metadata, and growth patterns
- Track content coverage, diversity, and modality mix, and identify gaps relative to customer demand
- Partner with Product and Partnerships to share catalog insights that inform sourcing priorities
- Work cross-functionally to ensure content packaging meets technical, ethical, and licensing requirements
- Develop methods, scripts, and internal tools that improve curation efficiency and scale
- Help shape Protege’s delivery platform, including how internal users and customers search, sample, and export data
- Work closely with embedding-based systems to iterate between algorithmic selection and human review
- Define best practices for embedding queries, relevance evaluation, and content diversity
- Maintain a high bar for operational excellence and quality assurance throughout the process
Requirements
- 4-7 years of experience in data science, media analytics, technical curation, or similarly hands-on data roles.
- Strong SQL proficiency and comfort querying large, messy datasets to generate insight and action.
- Experience working with media metadata, embeddings, or unstructured content.
- Ability to translate nuanced customer or model requirements into concrete dataset specifications.
- High standard for data quality, operational rigor, and usability of delivered outputs.
- Clear communicator who can move between technical depth and customer-friendly clarity.
- Thrive in ambiguous, fast-moving environments and treats teammates with kindness.
Benefits
- Health insurance
- Professional development opportunities
- Flexible work arrangements
- Remote work options
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
SQLdata qualitydata curationmedia analyticsmetadata managementdata normalizationdata standardizationembedding-based systemsdata validationdata analysis
Soft Skills
clear communicationcustomer-focusedoperational rigoradaptabilityteam collaborationproblem-solvingattention to detailcustomer requirement translationfeedback incorporationkindness