FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Computer Vision Researcher – VLM
Niantic Spatial, Inc.Computer Vision Researcher at Niantic Spatial developing spatial intelligence through LLMs. Leading research in multimodal AI and mentoring new researchers in London R&D hub.
Tech Stack
Tools & technologiesPyTorch
About the role
Key responsibilities & impact- Architect Semantic Grounding: Lead research into cross-modal grounding that connects 3D spatial features with language embeddings, enabling the LGM to "understand" object relationships and environmental context.
- Scale "Understand" Capabilities: Develop and deploy algorithms for continuous semantics, allowing our 3D maps to evolve and improve their situational awareness as new ground-level and aerial data is ingested.
- Agentic Frameworks: Build the "spatial brain" for Embodied AI, enabling robots, Drones and other Machines to move beyond simple navigation to mission-level reasoning.
- Multimodal Benchmarking: Define the standards for measuring "spatial common sense" in LLMs, creating evaluations that test a model’s ability to interpret and operate within complex 3D scenes.
- Technical Mentorship: Serve as the technical anchor for the London R&D hub, resolving architectural disagreements and mentoring the next generation of researchers in the fusion of 3D CV and NLP.
- Collaborative Innovation: Partner with Product leads to ensure the "Understand" API delivers high business value for enterprise customers in robotics, logistics, and field operations.
Requirements
What you’ll need- PhD (or equivalent) in Computer Vision, Machine Learning, or Robotics with a focus on Multimodal/Semantic understanding.
- 4+ years of experience in ML research, with a proven track record of shipping models that bridge 3D Vision and Language.
- Expert knowledge of 3D Geometry (SfM, SLAM, VPS) and Transformer-based architectures (VLMs).
- Multiple first-author publications at top-tier venues (CVPR, NeurIPS, ICLR) focusing on VLMs, scene understanding or semantic segmentation.
- Ability to write production-quality research code in PyTorch or JAX and manage large-scale data pipelines.
- Required In-Office Days: 3 days per week
- Experience with Gaussian Splatting or NeRFs for semantic scene representation.
- Background in robotics (ROS) or building agentic systems that interact with physical environments.
- Experience with "open-set" recognition and Zero-Shot learning.
Benefits
Comp & perks- Health insurance
- Flexible work arrangements
- Professional development opportunities
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Machine Learning3D VisionSemantic UnderstandingTransformer-Based ArchitecturesScene UnderstandingSemantic SegmentationOpen-Set RecognitionZero-Shot LearningContinuous SemanticsData Pipeline Management
Soft Skills
Technical MentorshipCollaborative Innovation