SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration Paper • 2603.03823 • Published 1 day ago • 3
Proact-VL: A Proactive VideoLLM for Real-Time AI Companions Paper • 2603.03447 • Published 2 days ago • 22
Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration? Paper • 2603.03202 • Published 2 days ago • 13
DREAM: Where Visual Understanding Meets Text-to-Image Generation Paper • 2603.02667 • Published 3 days ago • 4
PRISM: Pushing the Frontier of Deep Think via Process Reward Model-Guided Inference Paper • 2603.02479 • Published 3 days ago • 18
Learn Hard Problems During RL with Reference Guided Fine-tuning Paper • 2603.01223 • Published 4 days ago • 12
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Paper • 2602.24286 • Published 6 days ago • 74
DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model Paper • 2602.23622 • Published 7 days ago • 3
SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching Paper • 2602.24208 • Published 6 days ago • 7
Vectorizing the Trie: Efficient Constrained Decoding for LLM-based Generative Retrieval on Accelerators Paper • 2602.22647 • Published 8 days ago • 3
Causal Motion Diffusion Models for Autoregressive Motion Generation Paper • 2602.22594 • Published 8 days ago • 7
veScale-FSDP: Flexible and High-Performance FSDP at Scale Paper • 2602.22437 • Published 8 days ago • 7