Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding Paper • 2605.29707 • Published 15 days ago • 145
OpenComputer: Verifiable Software Worlds for Computer-Use Agents Paper • 2605.19769 • Published 24 days ago • 81
ChangeFlow -- Latent Rectified Flow for Change Detection in Remote Sensing Paper • 2605.15375 • Published 29 days ago • 5
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 30 days ago • 271
HyperEyes: Dual-Grained Efficiency-Aware Reinforcement Learning for Parallel Multimodal Search Agents Paper • 2605.07177 • Published May 8 • 62
SEIF: Self-Evolving Reinforcement Learning for Instruction Following Paper • 2605.07465 • Published May 8 • 30
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published May 7 • 233
Web2BigTable: A Bi-Level Multi-Agent LLM System for Internet-Scale Information Search and Extraction Paper • 2604.27221 • Published Apr 29 • 39
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published Apr 22 • 243
Experience Transfer for Multimodal LLM Agents in Minecraft Game Paper • 2604.05533 • Published Apr 7 • 16
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published Apr 8 • 327
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 506
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 632
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 352