Xiaohang Tang

timxiaohangt

xiaohangt

AI & ML interests

Reinforcement Learning, Game Theory

Recent Activity

upvoted a paper 13 days ago

GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models

upvoted a paper 4 months ago

LLM-WikiRace Benchmark: How Far Can LLMs Plan over Real-World Knowledge Graphs?

published a model 4 months ago

timxiaohangt/Qwen2.5-1.5B-Open-R1-GRPO

View all activity

Organizations

upvoted a paper 13 days ago

GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models

Paper • 2605.29398 • Published 18 days ago • 7

upvoted a paper 4 months ago

LLM-WikiRace Benchmark: How Far Can LLMs Plan over Real-World Knowledge Graphs?

Paper • 2602.16902 • Published Feb 18 • 10

published a model 4 months ago

timxiaohangt/Qwen2.5-1.5B-Open-R1-GRPO

Updated Feb 10

updated a model 5 months ago

timxiaohangt/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Jan 19

published a model 6 months ago

timxiaohangt/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Jan 19

updated a model 7 months ago

diffusion-reasoning/LLaDA-8B-Instruct-wd1-acecode-iter180

Image Feature Extraction • 8B • Updated Nov 15, 2025 • 3

published a model 7 months ago

diffusion-reasoning/LLaDA-8B-Instruct-wd1-acecode-iter180

Image Feature Extraction • 8B • Updated Nov 15, 2025 • 3

updated a model 7 months ago

diffusion-reasoning/LLaDA-8B-Instruct-wd1-acecode-iter100

Image Feature Extraction • 8B • Updated Nov 14, 2025 • 2

published a model 7 months ago

diffusion-reasoning/LLaDA-8B-Instruct-wd1-acecode-iter100

Image Feature Extraction • 8B • Updated Nov 14, 2025 • 2

updated a model 7 months ago

diffusion-reasoning/LLaDA-8B-Instruct-wd1-acecode-iter60

Image Feature Extraction • 8B • Updated Nov 14, 2025 • 3

published a model 7 months ago

diffusion-reasoning/LLaDA-8B-Instruct-wd1-acecode-iter60

Image Feature Extraction • 8B • Updated Nov 14, 2025 • 3

published 2 models 8 months ago

xiaohangt/LLaDA-8B-Instruct-wd1ucllfinal_ba-numinas_checkpoint-110

Updated Oct 9, 2025

xiaohangt/LLaDA-8B-Instruct-wd1ucllfinal_ba_1en8-numinas_checkpoint-70

Updated Oct 9, 2025

updated a model 8 months ago

xiaohangt/LLaDA-8B-Instruct-wd1ucllfinal_mdpoadv-numinas_checkpoint-20

Image Feature Extraction • 8B • Updated Oct 9, 2025 • 1

published 2 models 8 months ago

xiaohangt/LLaDA-8B-Instruct-wd1ucllfinal_mdpoadv-numinas_checkpoint-20

Image Feature Extraction • 8B • Updated Oct 9, 2025 • 1

xiaohangt/LLaDA-8B-Instruct-wd1ucllfinal1e6-gsm8ks_checkpoint-80

Updated Oct 9, 2025

updated a model 8 months ago

xiaohangt/LLaDA-8B-Instruct-wd1d1-maths_checkpoint-30

Image Feature Extraction • 8B • Updated Oct 9, 2025 • 1

published a model 8 months ago

xiaohangt/LLaDA-8B-Instruct-wd1d1-maths_checkpoint-30

Image Feature Extraction • 8B • Updated Oct 9, 2025 • 1

updated a model 8 months ago

xiaohangt/LLaDA-8B-Instruct-wd1scl-maths_checkpoint-60

Image Feature Extraction • 8B • Updated Oct 9, 2025 • 1

published a model 8 months ago

xiaohangt/LLaDA-8B-Instruct-wd1scl-maths_checkpoint-60

Image Feature Extraction • 8B • Updated Oct 9, 2025 • 1

Xiaohang Tang

AI & ML interests

Recent Activity

Organizations

timxiaohangt's activity