
Staff Data Engineer
VirtusLab
full-time
Posted on:
Location Type: Remote
Location: New York • United States
Visit company websiteExplore more
Job Level
About the role
- building web crawling or large-scale data systems from scratch
- designing scalable, fault-tolerant distributed systems
- leading complex technical initiatives
- mentoring engineers and promoting a collaborative culture
- operating ETL/ELT pipelines
- extracting structured/unstructured web data
Requirements
- Proven experience building web crawling or large-scale data systems from scratch
- Strong architectural skills in designing scalable, fault-tolerant distributed systems
- Track record leading complex technical initiatives and driving architecture direction for teams
- Demonstrated ability to evolve production systems incrementally while maintaining reliability
- Experience mentoring engineers at all levels and promoting a collaborative culture
- Deep background in large-scale data engineering (terabytes daily)
- Hands-on experience with cloud data warehouses (BigQuery, Snowflake)
- Experience with Apache Kafka, Kubernetes (GKE/EKS), and orchestration tools (Airflow)
- Familiarity with multi-cloud environments (GCP + AWS)
- Expertise in designing and operating ETL/ELT pipelines
- Deep expertise in web crawling technologies and advanced scraping (Scrapy or similar)
- Experience in extracting structured/unstructured web data and SERP extraction
- Knowledge of proxy infrastructure management, anti-bot detection, and ethical crawling
- Familiarity with crawling vendors and AI/LLM-based extraction approaches
- Support the VirtusLab U.S. and international teams by lending senior technical expertise to client-facing activities, including technical discovery sessions, workshops, and solution architecture
- Conduct requirements analysis and solution discovery, identifying business and technical needs
- Provide technical consulting and advisory services, recommending appropriate data architectures aligned with customer goals
- Prepare and review technical sections of commercial offers, including solution descriptions, statements of work (SoWs), project estimates, timelines, and delivery models
Benefits
- self-development opportunities
- good working conditions
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
web crawlinglarge-scale data systemsscalable systems designfault-tolerant systemsETL pipelinesELT pipelinesdata engineeringscraping technologiesApache KafkaKubernetes
Soft Skills
leadershipmentoringcollaborationtechnical consultingrequirements analysissolution discoverycommunicationadvisory servicesarchitectural directionproject management