Design and scale the infrastructure that powers large-scale multimodal training and evaluation
Manage distributed data pipelines and harden pipelines powering Sora’s rapid iteration cycles
Design, build, and maintain data infrastructure systems such as distributed compute, data orchestration, distributed storage, streaming infrastructure, machine learning infrastructure while ensuring scalability, reliability, and security
Ensure our data platform can scale by orders of magnitude while remaining reliable and efficient
Partner with researchers to deeply understand requirements and translate them into production-ready systems
Harden, optimize, and maintain critical data infrastructure systems that power multimodal training and evaluation
Requirements
Strong experience with distributed systems and large-scale infrastructure with a strong interest in data
Detail-oriented and bring rigor to building and maintaining reliable systems
Excellent software engineering fundamentals and organizational skills
Comfortable with ambiguity and rapid change
Ability to partner with researchers and translate requirements into production-ready systems
Experience designing, building, and maintaining data infrastructure systems such as distributed compute, data orchestration, distributed storage, streaming infrastructure, machine learning infrastructure