Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
4
wang
wzx111
Follow
AI & ML interests
None yet
Organizations
None yet
spaces
2
Sort: Recently updated
pinned
Sleeping
My Argilla
✍
好
Runtime error
Chatweb
📊
models
10
Sort: Recently updated
wzx111/14B-Aggressive-OPO-Delta-LR2e-6-G32
Updated
Mar 8
wzx111/14B-Aggressive-GSPO-LR2e-6-G32
Updated
Mar 8
wzx111/Qwen3-1.7B-GRPO-math
Updated
Nov 29, 2025
wzx111/Qwen3-1.7B-Open-R1-ADPO
Text Generation
•
2B
•
Updated
Nov 23, 2025
•
4
wzx111/Qwen3-1.7B-Open-R1-GRPO-Baseline
Text Generation
•
2B
•
Updated
Nov 22, 2025
•
5
wzx111/Qwen3-1.7B-Open-R1-GRPO
2B
•
Updated
May 14, 2025
wzx111/Qwen3-1.7B-Open-R1-GDPO-epcoh_
Text Generation
•
2B
•
Updated
May 14, 2025
•
3
wzx111/Qwen3-1.7B-MATH-GDPO-EPOCH2
Text Generation
•
2B
•
Updated
May 2, 2025
•
3
wzx111/Qwen3-1.7B-MATH-GDPO
Text Generation
•
2B
•
Updated
May 1, 2025
•
53
•
2
wzx111/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
Apr 28, 2025
•
18
datasets
3
Sort: Recently updated
wzx111/MATH-lighteval-level3
Viewer
•
Updated
Dec 9, 2025
•
2.72k
•
14
wzx111/MATH-lighteval-level-middlehigh
Viewer
•
Updated
Nov 24, 2025
•
5.63k
•
11
wzx111/MATH-lighteval-level-middle
Viewer
•
Updated
Nov 24, 2025
•
7.87k
•
21