2 11 6

Ruize Zhang

Ruize-Zhang

zrz-sh

AI & ML interests

Interested in RL

Recent Activity

upvoted a paper about 21 hours ago

RLinf-USER: A Unified and Extensible System for Real-World Online Policy Learning in Embodied AI

upvoted a paper 2 days ago

RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation

updated a dataset 6 days ago

RLinf/WideSeek-R1-test-data

View all activity

Organizations

upvoted a paper about 21 hours ago

RLinf-USER: A Unified and Extensible System for Real-World Online Policy Learning in Embodied AI

Paper • 2602.07837 • Published 4 days ago • 47

upvoted a paper 2 days ago

RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation

Paper • 2509.15965 • Published Sep 19, 2025 • 17

updated a dataset 6 days ago

RLinf/WideSeek-R1-test-data

Viewer • Updated 6 days ago • 200 • 15

published a dataset 6 days ago

RLinf/WideSeek-R1-test-data

Viewer • Updated 6 days ago • 200 • 15

New activity in RLinf/WideSeek-R1-4b 6 days ago

Add library_name, pipeline_tag, and arxiv metadata

#1 opened 7 days ago by

nielsr

authored a paper 6 days ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published 7 days ago • 91

upvoted a paper 7 days ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published 7 days ago • 91

updated a collection 7 days ago

WideSeek-R1

Collection

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning • 4 items • Updated 7 days ago

updated a dataset 7 days ago

RLinf/WideSeek-R1-Corpus

Updated 6 days ago • 261

updated a model 7 days ago

RLinf/WideSeek-R1-4b

Text Generation • 4B • Updated 6 days ago • 55 • 1

liked a dataset about 2 months ago

inclusionAI/ASearcher-Local-Knowledge

Viewer • Updated Aug 6, 2025 • 45.2M • 199 • 6

liked 2 models 3 months ago

changyeon/pi0_robocasa_100demos_base_pytorch

4B • Updated Nov 17, 2025 • 1

youliangtan/gr00t-n1.5-robocasa-tabletop-posttrain

3B • Updated Sep 16, 2025 • 4 • 1

upvoted a paper 3 months ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 108

liked a dataset 3 months ago

gaia-benchmark/GAIA

Viewer • Updated Oct 28, 2025 • 932 • 12.4k • 609

upvoted a paper 3 months ago

π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models

Paper • 2510.25889 • Published Oct 29, 2025 • 66

authored 4 papers 4 months ago

JuggleRL: Mastering Ball Juggling with a Quadrotor via Deep Reinforcement Learning

Paper • 2509.24892 • Published Sep 29, 2025

Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning

Paper • 2505.04317 • Published May 7, 2025 • 1

VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play

Paper • 2502.01932 • Published Feb 4, 2025

OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control

Paper • 2309.12825 • Published Sep 22, 2023

Ruize Zhang

AI & ML interests

Recent Activity

Organizations

Ruize-Zhang's activity

Add library_name, pipeline_tag, and arxiv metadata