arxiv:2410.09724
Banghua Zhu
banghua
AI & ML interests
Foundation models, reinforcement learning, statistics, information theory
Recent Activity
published an article 18 days ago
The Open Source Community is backing OpenEnv for Agentic RL liked a dataset 7 months ago
nvidia/Nemotron-RL-math-OpenMathReasoning updated a dataset 7 months ago
nvidia/Nemotron-RL-math-OpenMathReasoning