5 10 4

Zhiheng Wang

zhwang

w-zhih

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

upvoted a paper 3 months ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

upvoted a paper 3 months ago

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

View all activity

Organizations

None yet

upvoted a paper 13 days ago

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

Paper • 2606.07591 • Published 25 days ago • 93

upvoted 3 papers 3 months ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published Mar 23 • 138

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Paper • 2603.09877 • Published Mar 10 • 49

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 60

upvoted a paper 9 months ago

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3, 2025 • 99

liked a model 10 months ago

zai-org/GLM-4.5V

Image-Text-to-Text • 108B • Updated Oct 25, 2025 • 122k • • 719

upvoted a paper 10 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 212

liked a dataset about 1 year ago

TIGER-Lab/ViRL39K

Preview • Updated Apr 23, 2025 • 364 • 39

upvoted a paper about 2 years ago

To Believe or Not to Believe Your LLM

Paper • 2406.02543 • Published Jun 4, 2024 • 35

liked a Space over 2 years ago

Reward Bench Leaderboard

📐

432

Explore and compare model scores on RewardBench benchmarks

upvoted a paper over 2 years ago

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8, 2024 • 65

liked a dataset over 2 years ago

liyucheng/zhihu_rlhf_3k

Viewer • Updated Apr 15, 2023 • 3.46k • 87 • 94

upvoted 2 papers over 2 years ago

Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM

Paper • 2401.02994 • Published Jan 4, 2024 • 52

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Paper • 2401.06080 • Published Jan 11, 2024 • 27

updated a collection over 2 years ago

RLHF

Collection

1 item • Updated Nov 28, 2023

New activity in zhwang/HPDv2 over 2 years ago

Upload SDXL-refiner-0.9.tar.gz

#4 opened over 2 years ago by

xswu

updated a dataset over 2 years ago

zhwang/HPDv2

Viewer • Updated Nov 25, 2023 • 400 • 2.19k • 14

New activity in zhwang/HPDv2 almost 3 years ago

Upload test.json

#3 opened almost 3 years ago by

xswu

Upload test.json

#2 opened almost 3 years ago by

xswu

Zhiheng Wang

AI & ML interests

Recent Activity

Organizations

zhwang's activity

Reward Bench Leaderboard

Upload SDXL-refiner-0.9.tar.gz

Upload test.json

Upload test.json