Selective Training for Large Vision Language Models via Visual Information Gain Paper • 2602.17186 • Published 8 days ago • 2 • 3
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training Paper • 2602.10693 • Published 16 days ago • 185 • 6
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published 18 days ago • 211 • 8
Query-focused and Memory-aware Reranker for Long Context Processing Paper • 2602.12192 • Published 15 days ago • 48 • 4
Is Artificial Intelligence Generated Image Detection a Solved Problem? Paper • 2505.12335 • Published May 18, 2025 • 1
ManCAR: Manifold-Constrained Latent Reasoning with Adaptive Test-Time Computation for Sequential Recommendation Paper • 2602.20093 • Published 4 days ago • 26 • 4
SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning Paper • 2602.13515 • Published 14 days ago • 43 • 6
SkillOrchestra: Learning to Route Agents via Skill Transfer Paper • 2602.19672 • Published 4 days ago • 51 • 5
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published Aug 5, 2025 • 136 • 7
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published Oct 16, 2025 • 118 • 8
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper • 2407.16741 • Published Jul 23, 2024 • 76 • 6
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning Paper • 2602.08234 • Published 18 days ago • 67 • 3