Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
9
20
Xu Zhihao
naiweizi
Follow
mamasihan's profile picture
didiforhugface's profile picture
Jhonny999's profile picture
3 followers
·
0 following
AI & ML interests
Trustworthy AI
Recent Activity
upvoted
a
paper
about 19 hours ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation
authored
a paper
8 days ago
AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration
upvoted
a
paper
10 days ago
AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration
View all activity
Organizations
None yet
naiweizi
's models
12
Sort: Recently updated
naiweizi/r1-qwen-7b-sft_meta
8B
•
Updated
Nov 21, 2025
naiweizi/R1-Qwen-7B-SFT-Meta
Updated
Nov 21, 2025
naiweizi/R1-Qwen-1_5B-Cold_Start-OpenR1_Math-priority
2B
•
Updated
Jul 18, 2025
naiweizi/dpo-harmless_saferlhf
Updated
Jun 18, 2025
naiweizi/mistral-dpo-helpful-vanilla-1e-4
Updated
May 6, 2025
naiweizi/mistral-dpo-harmless-vanilla-2e-4
Updated
May 6, 2025
naiweizi/test
Text Generation
•
8B
•
Updated
Apr 21, 2025
•
1
naiweizi/dpo-harmless_helpful-vanilla
Updated
Apr 14, 2025
naiweizi/dpo-harmless_helpful-rc_armo
Updated
Apr 14, 2025
naiweizi/dpo-harmless_helpful-mixed
Updated
Apr 14, 2025
naiweizi/dpo-harmless_helpful-rc_armo_mistral
Updated
Apr 14, 2025
naiweizi/qwen2.5-instruct-sft_helpsteer2
8B
•
Updated
Mar 14, 2025