Salary
💰 $184,000 - $356,500 per year
Tech Stack
CloudGRPCPythonPyTorch
About the role
- Develop, profile and optimize data-center and edge computer vision workloads for efficiency, latency, and throughput (Python)
- Implement and improve computer vision and image processing algorithms using CUDA
- Upstream performance improvements to SDKs and libraries across NVIDIA to deliver accelerated computer vision at scale
- Influence software architecture, validation strategy and technical roadmaps to ensure outstanding performance
- Promote high-performance computer vision across NVIDIA teams and functions (Engineering, Product Management, Marketing, and more)
Requirements
- Master's of Science in Computer Science or Electrical engineering or equivalent experience
- 8 years of practical experience
- Excellent software engineering fundamentals (source control, CI/CD, testing/validation, packaging, containerization, release)
- Proven track record developing, testing and releasing production-grade, complex software
- Proficiency with Python, CUDA and C++
- Strong fundamentals with multi-threaded, multi-process and distributed software development
- Expertise defining and driving performance metrics through profiling and benchmarking
- Experience developing performance-critical data center and cloud applications (REST APIs, gRPC)
- Excellent written, visual, and verbal communication
- Curiosity and drive to learn new technologies and partner across teams and functions
- (Nice-to-have) Expertise in classical, non-ML computer vision
- (Nice-to-have) Expertise in ML computer vision (VLMs, Vision Transformers, Diffusion models) and ecosystem: PyTorch, HuggingFace, vLLM
- (Nice-to-have) Grounding in linear algebra, numerical methods, statistics, and exploratory data analysis
- (Nice-to-have) History of creativity and innovation around performance in multiple problem domains