
Data Infrastructure Advocate Engineer
Hugging Face
full-time
Posted on:
Location Type: Remote
Location: New York • United States
Visit company websiteExplore more
Tech Stack
About the role
- Grow and nurture the open-source data/infra community—launch initiatives, collaborate with data-focused groups, and organize events or challenges.
- Promote the Hugging Face Hub as the go-to platform for data storage, versioning, and collaboration—curate and showcase datasets, benchmarks, and tools like Xet.
- Highlight use cases like efficient large dataset updates, Parquet editing, and deduplication to demonstrate the Hub’s value for data workflows.
- Create demos, benchmarks, and tools (e.g., Colab notebooks) to illustrate best practices for data storage and versioning.
- Experiment with Xet, Parquet, and other data formats to showcase their potential for ML and data engineering.
- Produce high-quality tutorials, blog posts, and videos that make complex topics accessible.
- Share insights on storage optimization, dataset versioning, and deduplication to empower developers.
- Actively participate in online communities (Discord, GitHub, forums) to highlight contributions, answer questions, and foster collaboration.
- Ensure datasets and tools released on the Hub are well-documented, with clear examples, benchmarks, and use cases.
Requirements
- Strong technical skills in Python, data libraries (e.g., pandas, pyarrow, huggingface/datasets), and storage systems (Parquet, Open Table Formats, S3).
- Hands-on builder who loves experimenting with data tools, storage optimization, and dataset versioning.
- Ability to explain complex topics (e.g., deduplication, compression, Parquet editing) through writing, demos, or talks.
- Active participation in developer communities (GitHub, Discord, forums) and passion for open source and knowledge sharing.
- Thrive in fast-moving environments and enjoy building in public to inspire others.
Benefits
- Health, dental, and vision benefits for employees and their dependents.
- Parental leave.
- Flexible paid time off.
- Flexible working hours and remote options.
- Reimbursement for relevant conferences, training, and education.
- Company equity as part of compensation package.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Pythonpandaspyarrowhuggingface/datasetsParquetOpen Table FormatsS3deduplicationcompressiondata storage
Soft Skills
communicationcollaborationknowledge sharingcommunity engagementwritingpublic speakingcreativityproblem-solvingadaptabilityinitiative