Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
39
198
49
KABI
dongguanting
Follow
HediZhao's profile picture
asusevski's profile picture
NuralNexus's profile picture
60 followers
·
97 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
authored
a paper
about 17 hours ago
ET-Agent: Incentivizing Effective Tool-Integrated Reasoning Agent via Behavior Calibration
upvoted
a
paper
about 22 hours ago
ET-Agent: Incentivizing Effective Tool-Integrated Reasoning Agent via Behavior Calibration
authored
a paper
1 day ago
EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis
View all activity
Organizations
dongguanting
's models
16
Sort: Recently updated
dongguanting/Qwen3-8B-AEPO-DeepSearch
Text Generation
•
8B
•
Updated
24 days ago
•
23
•
2
dongguanting/QwQ-32B-AEPO-DeepSearch
Text Generation
•
33B
•
Updated
24 days ago
•
13
•
1
dongguanting/QwQ-32B-ARPO-DeepSearch
33B
•
Updated
24 days ago
•
9
•
1
dongguanting/aepo_light
8B
•
Updated
Nov 3, 2025
•
6
dongguanting/Qwen2.5-7B-AEPO
Text Generation
•
8B
•
Updated
Oct 27, 2025
•
15
•
1
dongguanting/Qwen3-14B-AEPO-DeepSearch
Robotics
•
15B
•
Updated
Oct 21, 2025
•
7
•
1
dongguanting/Qwen2.5-7B-ARPO
Text Generation
•
8B
•
Updated
Aug 19, 2025
•
22
•
2
dongguanting/Llama3.1-8B-ARPO
Text Generation
•
8B
•
Updated
Aug 12, 2025
•
10
•
1
dongguanting/Qwen2.5-3B-ARPO
Text Generation
•
3B
•
Updated
Aug 12, 2025
•
37
•
3
dongguanting/Qwen3-14B-ARPO-DeepSearch
Text Generation
•
15B
•
Updated
Aug 12, 2025
•
10
•
5
dongguanting/Qwen3-8B-ARPO-DeepSearch
8B
•
Updated
Jul 29, 2025
•
14
•
2
dongguanting/Tool-Star-Qwen-7B
Text Generation
•
8B
•
Updated
Jun 30, 2025
•
6
•
2
dongguanting/RAG-Critic-3B
Text Generation
•
3B
•
Updated
Jun 28, 2025
•
43
•
4
dongguanting/Tool-Star-Qwen-0.5B
Text Generation
•
0.6B
•
Updated
Jun 6, 2025
•
2
•
1
dongguanting/Tool-Star-Qwen-1.5B
Text Generation
•
2B
•
Updated
Jun 6, 2025
•
2
dongguanting/Tool-Star-Qwen-3B
Text Generation
•
3B
•
Updated
May 25, 2025
•
5
•
5