
AI Research Engineer – Vision AI, VLM, Physical AI
Thermo Fisher Scientific
full-time
Posted on:
Location Type: Remote
Location: Washington • United States
Visit company websiteExplore more
Salary
💰 $140,000 - $150,000 per year
Tech Stack
About the role
- Advance Visual Perception: Build and fine‑tune models for detection, tracking, segmentation (2D/3D), pose & activity recognition, and scene understanding (incl. 360° and multi‑view).
- Multimodal Reasoning with VLMs: Train/evaluate vision–language models (VLMs) for grounding, dense captioning, temporal QA, and tool use; design retrieval-augmented and agentic loops for perception-action tasks.
- Physical AI & Embodiment: Prototype perception‑in‑the‑loop policies that close the gap from pixels to actions (simulation + real data). Integrate with planners and task graphs for manipulation, navigation, or safety workflows.
- Data & Evaluation at Scale: Curate datasets, author high-signal evaluation protocols/KPIs, and run ablations that make results irreproducible impossible.
- Systems & Deployment: Package research into reliable services on a modern stack (Kubernetes, Docker, Ray, FastAPI), with profiling, telemetry, and CI for reproducible science.
- Agentic Workflows: Orchestrate multi-agent pipelines (e.g., ‑LangGraphstyle graphs) that combine perception, reasoning, simulation, and ‑code generation to ‑selfcheck and ‑selfcorrect.
Requirements
- Masters/Ph.D in CS/EE/Robotics (or related), actively publishing in CV/ML/Robotics (e.g., CVPR/ICCV/ECCV, NeurIPS/ICML/ICLR, CoRL/RSS).
- Strong PyTorch (or JAX) and Python; comfort with CUDA profiling and mixed precision training.
- Demonstrated research in computer vision and at least one of: VLMs (e.g., LLaVA style, video-language-models), embodied/physical AI, 3D perception.
- Proven ability to move from paper → code → ablation → result with rigorous experiment tracking.
Benefits
- Real impact: Your research ships—powering core features in our MVPs and products.
- Mentorship: Work closely with our Principal Architect and senior engineers/researchers.
- Velocity + Rigor: We balance top‑tier research practices with pragmatic product focus.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
computer visionmultimodal reasoningvision-language modelsdetectiontrackingsegmentationpose recognitionactivity recognitionPythonPyTorch
Soft Skills
research abilityexperiment trackingproblem-solvingcollaboration
Certifications
Masters in Computer SciencePh.D in Computer ScienceMasters in Electrical EngineeringPh.D in Electrical EngineeringMasters in RoboticsPh.D in Robotics