Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Dan Zhang

zd21

jghwang's profile picture

dark-pen's profile picture

21world's profile picture

·

https://zhangdan0602.github.io/

ZhangDa57152861
zhangdan0602

AI & ML interests

None yet

Organizations

None yet

zd21 's collections 1

Learning Smooth Reward Models with Temporal Difference for LLM RL and Inference

zd21/DeepSeek-TD0-PRM

Updated Jul 12, 2025
zd21/DeepSeek-TD2-PRM

Updated Jul 12, 2025
zd21/DeepSeek-ScalarPRM

Updated Jul 12, 2025
zd21/DeepSeek-ScalarORM

Updated Jul 12, 2025

Learning Smooth Reward Models with Temporal Difference for LLM RL and Inference

zd21/DeepSeek-TD0-PRM

Updated Jul 12, 2025
zd21/DeepSeek-TD2-PRM

Updated Jul 12, 2025
zd21/DeepSeek-ScalarPRM

Updated Jul 12, 2025
zd21/DeepSeek-ScalarORM

Updated Jul 12, 2025

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs