Hugging Face

Data Infrastructure Advocate Engineer

Hugging Face

full-time

Posted on:

Location Type: Remote

Location: New YorkUnited States

Visit company website

Explore more

AI Apply
Apply

About the role

  • Grow and nurture the open-source data/infra community—launch initiatives, collaborate with data-focused groups, and organize events or challenges.
  • Promote the Hugging Face Hub as the go-to platform for data storage, versioning, and collaboration—curate and showcase datasets, benchmarks, and tools like Xet.
  • Highlight use cases like efficient large dataset updates, Parquet editing, and deduplication to demonstrate the Hub’s value for data workflows.
  • Create demos, benchmarks, and tools (e.g., Colab notebooks) to illustrate best practices for data storage and versioning.
  • Experiment with Xet, Parquet, and other data formats to showcase their potential for ML and data engineering.
  • Produce high-quality tutorials, blog posts, and videos that make complex topics accessible.
  • Share insights on storage optimization, dataset versioning, and deduplication to empower developers.
  • Actively participate in online communities (Discord, GitHub, forums) to highlight contributions, answer questions, and foster collaboration.
  • Ensure datasets and tools released on the Hub are well-documented, with clear examples, benchmarks, and use cases.

Requirements

  • Strong technical skills in Python, data libraries (e.g., pandas, pyarrow, huggingface/datasets), and storage systems (Parquet, Open Table Formats, S3).
  • Hands-on builder who loves experimenting with data tools, storage optimization, and dataset versioning.
  • Ability to explain complex topics (e.g., deduplication, compression, Parquet editing) through writing, demos, or talks.
  • Active participation in developer communities (GitHub, Discord, forums) and passion for open source and knowledge sharing.
  • Thrive in fast-moving environments and enjoy building in public to inspire others.
Benefits
  • Health, dental, and vision benefits for employees and their dependents.
  • Parental leave.
  • Flexible paid time off.
  • Flexible working hours and remote options.
  • Reimbursement for relevant conferences, training, and education.
  • Company equity as part of compensation package.
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Pythonpandaspyarrowhuggingface/datasetsParquetOpen Table FormatsS3deduplicationcompressiondata storage
Soft Skills
communicationcollaborationknowledge sharingcommunity engagementwritingpublic speakingcreativityproblem-solvingadaptabilityinitiative