d-Matrix

Software Engineering, Senior Director - Kernels

d-Matrix

full-time

Posted on:

Origin:  • 🇺🇸 United States • California

Visit company website
AI Apply
Manual Apply

Salary

💰 $180,000 - $300,000 per year

Job Level

Senior

Tech Stack

CloudLinuxPythonPyTorchTensorflow

About the role

  • The role requires you to be part of the team that helps productize the SW stack for our AI compute engine.
  • As part of the Software team, you will be responsible for the development, enhancement, and maintenance of software kernels for next-generation AI hardware.
  • You possess experience building software kernels for HW architectures and map algorithms to the architecture.
  • You understand how to map computational graphs generated by AI frameworks to the underlying architecture.
  • You have had past experience working across all aspects of the full stack tool chain and understand the nuances of what it takes to optimize and trade-off various aspects of hardware-software co-design.
  • You are able to build and scale software deliverables in a tight development window.
  • You will work with a team of compiler experts to build out the compiler infrastructure working closely with other software (ML, Systems) and hardware (mixed signal, DSP, CPU) experts in the company.

Requirements

  • MS or PhD in Computer Engineering, Math, Physics or related degree with 12+ years of industry experience.
  • Strong grasp of computer architecture, data structures, system software, and machine learning fundamentals.
  • Proficient in C/C++ and Python development in Linux environment and using standard development tools.
  • Experience implementing algorithms in high level languages such as C/C++, Python.
  • Experience implementing algorithms for specialized hardware such as FPGAs, DSPs, GPUs, AI accelerators using libraries such as CuDA etc.
  • Experience in implementing operators commonly used in ML workloads - GEMMs, Convolutions, BLAS, SIMD operators for operations like softmax, layer normalization, pooling etc.
  • Experience with development for embedded SIMD vector processors such as Tensilica.
  • Self-motivated team player with a strong sense of ownership and leadership.
  • Prior startup, small team or incubation experience.
  • Experience with ML frameworks such as TensorFlow and.or PyTorch.
  • Experience working with ML compilers and algorithms, such as MLIR, LLVM, TVM, Glow, etc.
  • Experience with a deep learning framework (such as PyTorch, Tensorflow) and ML models for CV, NLP, or Recommendation.
  • Work experience at a cloud provider or AI compute / sub-system company.