🏝️ On Vacation

xuxin

xx18

·

https://xinxu-ustc.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

upvoted a paper about 2 months ago

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

new activity 2 months ago

xx18/Composition-RL-4B-Depth1_2_3:Add model card and metadata

View all activity

Organizations

Collections 2

Papers 19

arxiv:2605.11739

arxiv:2603.05369

arxiv:2602.12036

arxiv:2511.15248

models 23

xx18/Composition-RL-4B-Depth1_2_3

Text Generation • 4B • Updated Apr 27 • 13

xx18/Composition-RL-4B-Depth1_2

Text Generation • 4B • Updated Apr 27 • 4

xx18/Baseline-4B-MATH12K

Text Generation • 4B • Updated Apr 27 • 27

xx18/Composition-RL-4B-Physics_Math

Text Generation • 4B • Updated Apr 27 • 4

xx18/Composition-RL-30B-A3B

Text Generation • 31B • Updated Apr 27 • 14

xx18/Composition-RL-14B

Text Generation • 15B • Updated Apr 27 • 8

xx18/Composition-RL-8B

Text Generation • 8B • Updated Apr 27 • 13 • 1

xx18/Composition-RL-4B

Text Generation • 4B • Updated Apr 27 • 22

xx18/TFPI-Qwen3-4B-Thinking-2507-Stage3

Text Generation • 4B • Updated Feb 12 • 12

xx18/DirectRL_Qwen3-4B_baseline2

Text Generation • 4B • Updated Feb 12 • 12

datasets 7

xx18/MATH-Composition-Depth3

Viewer • Updated Apr 27 • 132k • 29

xx18/Physics-MATH-Composition-141K

Viewer • Updated Apr 27 • 141k • 36

xx18/MATH-Composition-199K

Viewer • Updated Apr 27 • 199k • 24 • 1

xx18/Composition-RL-EVA

Viewer • Updated Apr 27 • 12.8k • 36 • 1

xx18/Polaris-Composition-1323K

Viewer • Updated Apr 27 • 1.32M • 33 • 1

xx18/TFPI-EVA

Preview • Updated Sep 28, 2025 • 18 • 1

xx18/R2PE

Viewer • Updated Feb 21, 2024 • 38.7k • 87 • 2