NVIDIA

Senior Computer Vision, VLM Performance Engineer

NVIDIA

full-time

Posted on:

Origin:  • 🇺🇸 United States • California, Washington

Visit company website
AI Apply
Manual Apply

Salary

💰 $184,000 - $356,500 per year

Job Level

Senior

Tech Stack

CloudGRPCPythonPyTorch

About the role

  • Develop, profile and optimize inference pipelines for VLMs and other AI CV models: improve throughput and latency, data loading, pre- and post-processing.
  • Improve the efficiency of VLM models themselves: kernel optimization in CUDA
  • Upstream improvements to SDKs and libraries across NVIDIA and beyond to deliver accelerated computer vision at scale.
  • Promote high-performance AI computer vision across NVIDIA teams and functions (Engineering, Product Management, Marketing, and more).

Requirements

  • Master's of Science in Computer Science or Electrical engineering or equivalent experience.
  • 8 years practical experience or equivalent
  • Expertise in AI computer vision (VLMs, Vision Transformers, Diffusion models).
  • Proven track record using its software ecosystem (PyTorch, HuggingFace, vLLM) to develop and release production-grade software.
  • Excellent software engineering fundamentals (source control, CI/CD, testing/validation, packaging, containerization, release).
  • Proficiency with Python, C++ and CUDA (kernel optimization)
  • Experience developing cloud applications (REST APIs, gRPC).
  • Excellent written, visual, and verbal communication to present performance challenges, tradeoffs, and architectural alternatives.
  • Curiosity and drive to learn new technologies and partner across teams and functions.