InterleaveThinker: Reinforcing Agentic Interleaved Generation Paper • 2606.13679 • Published 6 days ago • 78
RoboStressBench: Benchmarking VLM Robustness to Physical Visual Stress in Embodied Scenes Paper • 2606.00828 • Published 18 days ago • 10
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration Paper • 2605.20025 • Published 29 days ago • 189
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 Sentence Similarity • 0.1B • Updated Jan 28 • 34.4M • • 1.27k
IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools Paper • 2605.20682 • Published 28 days ago • 83
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published May 12 • 195
VGGT-Edit: Feed-forward Native 3D Scene Editing with Residual Field Prediction Paper • 2605.15186 • Published May 14 • 26
Crowded in B-Space: Calibrating Shared Directions for LoRA Merging Paper • 2604.16826 • Published Apr 18 • 18
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published Apr 13 • 102
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • 22.7M • Updated 15 days ago • 171M • • 4.95k
CoME-VL: Scaling Complementary Multi-Encoder Vision-Language Learning Paper • 2604.03231 • Published Apr 3 • 7
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 633