Salary
💰 $200,000 - $250,000 per year
Tech Stack
AWSAzureCloudDistributed SystemsGoogle Cloud PlatformKubernetesPyTorchTensorflow
About the role
- About Alluxio: Proven at a global scale in production for modern AI and data services, Alluxio is the premier developer of data orchestration software. Alluxio is in production use today at eight out of the top ten internet companies, and seven of the ten highest valued companies in the world. Our mission is to orchestrate data for all data driven applications in any cloud!
- Alluxio's data orchestration platform is a meta-layer that sits between storage and compute engines, serving data to large-scale AI and analytics in any cloud across clusters, regions, clouds, and countries, providing simplified data access to files and objects. Features like intelligent caching, unified namespace, and data management provide agility and cost efficiency to customers in financial services, high-tech, retail, and telecommunications.
- Alluxio is trusted by Meta, Uber, Tencent, Tiktok, Alibaba, Expedia, Rakuten, Microsoft, Walmart, and more! Please review Wikipedia to learn more about us! Join our world-class team of empathetic, enthusiastic, and creative people who can work on some of the toughest big data problems.
- About the Role: As a Senior Product Manager, you will own the product strategy and roadmap for AI inference capabilities, collaborating with ML engineers and practitioners to deliver low‑latency, high‑throughput data access for LLMs, generative AI, computer vision, and NLP. Your focus is on features that optimize resource utilization and reduce total cost of ownership.
Requirements
- 4-8 years of experience in product management, AI infrastructure, or ML engineering roles, with at least 2 years focused on AI/ML workloads.
- Deep understanding of AI/ML workflows, including model deployment, inference optimization, and data-access patterns.
- Proven track record of delivering features with measurable improvements in latency, throughput, or GPU utilization, or equivalent experience.
- Technical proficiency in distributed systems and cloud platforms (Kubernetes, AWS/GCP/Azure) and familiarity with frameworks like PyTorch, TensorFlow, Triton Inference Server or similar.
- Excellent communication and cross-functional leadership skills, enabling clear translation of complex technical concepts to varied audiences.