GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models Paper • 2605.29398 • Published 18 days ago • 7
LLM-WikiRace Benchmark: How Far Can LLMs Plan over Real-World Knowledge Graphs? Paper • 2602.16902 • Published Feb 18 • 10