Sumble

Data Engineer

Sumble

full-time

Posted on:

Origin:  • 🇺🇸 United States

Visit company website
AI Apply
Manual Apply

Job Level

Mid-LevelSenior

Tech Stack

CloudGoogle Cloud PlatformPostgresPythonPyTorchReactSQLTypeScript

About the role

  • Build Sumble's knowledge graph from web data for go-to-market teams, using job posts and resume data to identify org structure, tech stack, and key projects (e.g., GenAI initiatives, cloud migrations)
  • Join a 15-person team including 10 engineers with experience at Google, Meta, Stack Overflow, and Kaggle
  • Build mission-critical, flexible, and scalable data pipelines focusing on reliability, data consistency, and data recovery
  • Explore and define standards for data access and analytics patterns; develop evolving data warehouse and data lake solutions
  • Work with SQL, orchestrators, and data modeling; tech stack includes Python, FastAPI, TypeScript, GCP, PostgreSQL/AlloyDB, PyTorch/Huggingface/vLLM, Prefect, and Cloud Run
  • Tackle noisy datasets, expensive analytics computations, growing data/model complexity, and create UX supporting both aggregated and granular source data

Requirements

  • Located within Americas timezones