Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning Paper • 2602.07845 • Published 4 days ago • 64
MOVA: Towards Scalable and Synchronized Video-Audio Generation Paper • 2602.08794 • Published 3 days ago • 144
AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research Paper • 2602.06540 • Published 6 days ago • 20
Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models Paper • 2602.07026 • Published 10 days ago • 129
QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining Paper • 2602.07085 • Published 6 days ago • 177
VLS: Steering Pretrained Robot Policies via Vision-Language Models Paper • 2602.03973 • Published 9 days ago • 22
FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents Paper • 2602.01566 • Published 11 days ago • 46