FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Software Engineer, Labelling, Data & Automation
WaabiSoftware Engineer developing tools and pipelines for Waabi's AI-focused self-driving technology. Collaborating with teams to create reliable data solutions and supporting machine learning projects.
Tech Stack
Tools & technologiesAirflowApacheAWSCloudHadoopMapReducePythonSparkTerraform
About the role
Key responsibilities & impact- Design and implement tools, pipelines, and metrics to accelerate the development of our AI-first autonomy system and generative AI simulator.
- Own the process, criteria, and tooling for efficiently finding interesting and relevant data across the petabytes of real world data that Waabi has collected.
- Build high reliability systems for extracting and labelling the interesting data with various vendors and integrate it back into our system.
- Work with both internal and third party stakeholders to define taxonomy, validation rules and success criteria for our labelling projects.
- Design and manage the end-to-end deployment of data solutions to deliver high quality labelled data for various ML teams to use in experiments and model improvement.
- Deploy open-set / embedding models to our production environment - empowering new search and curation modalities.
- Champion engineering excellence, ensuring high-quality, well structured, and tested code.
- Contribute to project roadmap planning, prioritization, and delivery.
Requirements
What you’ll need- 4+ years of industry experience.
- Bachelor's in computer science, engineering, machine learning, or a related technical discipline.
- Proficient in Python programming and strong software engineering fundamentals with real-world experience writing high quality, well-structured, and well-tested code.
- A willingness and ability to learn new skills, technologies, and software libraries as required.
- Strong experience with data pipelines for large-scale processing and analysis.
- Strong communication and organizational skills.
- Understanding of cloud job orchestration, monitoring, and instrumentation best-practices.
- Open-minded and collaborative team player with the willingness to help others.
- Passionate about self-driving technologies, solving hard problems, and creating innovative solutions.
- Bonus Points:
- Experience with ML pipelines, including dataset curation, labelling, training and evaluation.
- Previous experience in self-driving technology or related fields.
- Familiar with linear algebra (projections, transforms) and 3D geometry.
- Experience with MapReduce frameworks (Apache Hadoop/Spark) or orchestration frameworks (Apache Airflow/Apache Beam/Google Dataflow/AWS Step Functions).
- Experience with with front end development.
- Experience working with open-set / embedding models and deploying them in a production setting.
- Experience working with infra as code (Terraform, CloudFormation, etc).
Benefits
Comp & perks- Competitive compensation and equity awards.
- Health and Wellness benefits encompassing Medical, Dental and Vision coverage (for full-time employees only).
- Unlimited Vacation.
- Flexible hours and Work from Home support.
- Daily drinks, snacks and catered meals (when in office).
- Regularly scheduled team building activities and social events both on-site, off-site & virtually.
- As we grow, this list continues to evolve!
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Pythondata pipelinesML pipelinesMapReduce frameworksApache HadoopApache SparkApache AirflowApache BeamGoogle DataflowAWS Step Functions
Soft Skills
strong communication skillsorganizational skillscollaborative team playerwillingness to learnproblem-solvingengineering excellence
Certifications
Bachelor's in computer scienceBachelor's in engineeringBachelor's in machine learning