Grupo Protege

Solutions Engineer – Media

Grupo Protege

full-time

Posted on:

Location Type: Remote

Location: Brazil

Visit company website

Explore more

AI Apply
Apply

Tech Stack

About the role

  • Own data quality and curate media datasets
  • Partner with Sales and Solutions to translate customer requirements into curation strategies
  • Work with imperfect partner data, including mismatched metadata, schema differences, and incomplete labeling
  • Normalize and standardize datasets for reliable downstream use
  • Query and analyze Protege’s media catalog using SQL, internal APIs, and metadata tools to identify relevant content
  • Build validation checks and workflows to ensure dataset integrity before delivery
  • Identify, debug, and resolve data quality issues across file structures, metadata, and content alignment
  • Use AI tools and transcoded embeddings to surface and refine clip-level content
  • Turn messy, real-world data into structured datasets that meet customer and model requirements
  • Run iterative sample reviews with customers, incorporate feedback, refine selections, and ensure final packages meet spec
  • Build deep expertise in Protege’s media catalog structure, metadata, and growth patterns
  • Track content coverage, diversity, and modality mix, and identify gaps relative to customer demand
  • Partner with Product and Partnerships to share catalog insights that inform sourcing priorities
  • Work cross-functionally to ensure content packaging meets technical, ethical, and licensing requirements
  • Develop methods, scripts, and internal tools that improve curation efficiency and scale
  • Help shape Protege’s delivery platform, including how internal users and customers search, sample, and export data
  • Work closely with embedding-based systems to iterate between algorithmic selection and human review
  • Define best practices for embedding queries, relevance evaluation, and content diversity
  • Maintain a high bar for operational excellence and quality assurance throughout the process

Requirements

  • 4-7 years of experience in data science, media analytics, technical curation, or similarly hands-on data roles.
  • Strong SQL proficiency and comfort querying large, messy datasets to generate insight and action.
  • Experience working with media metadata, embeddings, or unstructured content.
  • Ability to translate nuanced customer or model requirements into concrete dataset specifications.
  • High standard for data quality, operational rigor, and usability of delivered outputs.
  • Clear communicator who can move between technical depth and customer-friendly clarity.
  • Thrive in ambiguous, fast-moving environments and treats teammates with kindness.
Benefits
  • Health insurance
  • Professional development opportunities
  • Flexible work arrangements
  • Remote work options
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
SQLdata qualitydata curationmedia analyticsmetadata managementdata normalizationdata standardizationembedding-based systemsdata validationdata analysis
Soft Skills
clear communicationcustomer-focusedoperational rigoradaptabilityteam collaborationproblem-solvingattention to detailcustomer requirement translationfeedback incorporationkindness