Salary
💰 $250,000 - $400,000 per year
About the role
- Help build new multimodal AI systems for media generation
- Improve performance of model training and inference
- Exercise strong low-level understanding of GPU workloads in a fast-paced, high-ownership environment
- Optimize state-of-the-art AI models for video generation, such as Gen-3 and other video models
- Build tooling to improve the efficiency and reliability of distributed training runs on Runway’s HPC cluster
- Telecommute from anywhere in the US with occasional travel to HQ
Requirements
- 5 years of experience in the job offered or similar software engineering position
- 5 years of experience in a role optimizing machine learning model inference and training
- 5 years of experience with Python, C/C++, CUDA
- 5 years of experience profiling GPU performance and distributed training runs
- Experience with ML framework (such as PyTorch), optimized runtimes for inference (such as TensorRT) or compilers (such as GCC)
- Bachelor’s degree in Computer Science, Mathematics, Applied Mathematics, or a closely related field, or foreign degree equivalent