Data Scientist

• Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users.
• You will develop and optimize RL models for enterprise-scale applications such as customer service, token reporting, compliance, and Web3 domain reasoning.
• You will explore and evaluate advanced algorithms including PPO, GRPO, DPO, RLHF, RLAIF, and Agentic RL to enhance the capabilities of LLMs, VLMs, and Agentic AI at Binance.
• The role requires a strong theoretical foundation in RL—covering policy optimization, reward modeling, and planning—paired with the engineering skills to build scalable production systems.
• You will take full ownership from research through deployment, driving experimentation with systematic evaluation and benchmarking.
• Collaboration across research, infrastructure, and application teams will be key to delivering impactful AI solutions.

Data Scientist, Reinforcement Learning

Job Level

Tech Stack

About the role

Requirements