Research Crawling Engineer

. Build and maintain large-scale web crawlers across diverse domains .

Posted 4/20/2026full-timeRemote • 🇺🇸 United StatesMid-LevelSeniorWebsite

Tools & technologies

Distributed SystemsGoJavaJavaScriptPythonRust

Key responsibilities & impact

Build and maintain large-scale web crawlers across diverse domains
Design high-throughput, fault-tolerant systems for data collection (millions to billions of URLs/day)
Handle anti-bot systems, rate limits, and dynamic/JS-heavy sites
Develop pipelines for cleaning, deduplication, filtering, and normalization
Construct and maintain datasets for research and model training
Monitor crawl performance, coverage, and data quality; iterate quickly
Collaborate with research teams to align data collection with modeling needs
Optimize infrastructure for cost, latency, and reliability

What you’ll need

Comp & perks

ATS Keywords

✓ Tailor your resume

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools

GoRustPythonJavaC++web crawlersdata pipelinesHTTPdistributed systemsparallel processing

Soft Skills

collaborationproblem-solvingdebuggingiteration