GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models Paper • 2605.29398 • Published 18 days ago • 7
LLM-WikiRace Benchmark: How Far Can LLMs Plan over Real-World Knowledge Graphs? Paper • 2602.16902 • Published Feb 18 • 10
diffusion-reasoning/LLaDA-8B-Instruct-wd1-acecode-iter180 Image Feature Extraction • 8B • Updated Nov 15, 2025 • 3
diffusion-reasoning/LLaDA-8B-Instruct-wd1-acecode-iter180 Image Feature Extraction • 8B • Updated Nov 15, 2025 • 3
diffusion-reasoning/LLaDA-8B-Instruct-wd1-acecode-iter100 Image Feature Extraction • 8B • Updated Nov 14, 2025 • 2
diffusion-reasoning/LLaDA-8B-Instruct-wd1-acecode-iter100 Image Feature Extraction • 8B • Updated Nov 14, 2025 • 2
diffusion-reasoning/LLaDA-8B-Instruct-wd1-acecode-iter60 Image Feature Extraction • 8B • Updated Nov 14, 2025 • 3
diffusion-reasoning/LLaDA-8B-Instruct-wd1-acecode-iter60 Image Feature Extraction • 8B • Updated Nov 14, 2025 • 3
xiaohangt/LLaDA-8B-Instruct-wd1ucllfinal_mdpoadv-numinas_checkpoint-20 Image Feature Extraction • 8B • Updated Oct 9, 2025 • 1
xiaohangt/LLaDA-8B-Instruct-wd1ucllfinal_mdpoadv-numinas_checkpoint-20 Image Feature Extraction • 8B • Updated Oct 9, 2025 • 1
xiaohangt/LLaDA-8B-Instruct-wd1d1-maths_checkpoint-30 Image Feature Extraction • 8B • Updated Oct 9, 2025 • 1
xiaohangt/LLaDA-8B-Instruct-wd1d1-maths_checkpoint-30 Image Feature Extraction • 8B • Updated Oct 9, 2025 • 1
xiaohangt/LLaDA-8B-Instruct-wd1scl-maths_checkpoint-60 Image Feature Extraction • 8B • Updated Oct 9, 2025 • 1
xiaohangt/LLaDA-8B-Instruct-wd1scl-maths_checkpoint-60 Image Feature Extraction • 8B • Updated Oct 9, 2025 • 1