Scoping, designing, and implementing high quality and performance numerical dense linear algebra software on GPUs. Providing technical leadership and feedback to library engineers working with you on projects and sometimes mentor interns. Working closely with product management and other internal and external customers to understand feature and performance requirements and help define the technical roadmaps of libraries. Finding opportunities to improve library performance and reduce code maintenance overhead through re-architecting.
Requirements
PhD or Master’s degree in Computer Science, Applied Math, or related science or engineering field of study (or equivalent experience). 5+ years of experience in designing, developing, testing, maintenance, and performance optimization of production software using CUDA and C++. Good knowledge of GPU (preferred) or CPU hardware architecture. Strong fundamentals in finite precision arithmetics and numerical methods for linear algebra. Great teamwork, communication, and documentation habits.