CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare Paper • 2603.24157 • Published 2 days ago • 8
When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning Paper • 2603.21289 • Published 5 days ago • 17
Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs? Paper • 2603.24472 • Published 2 days ago • 35
RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action Models Paper • 2603.21341 • Published 5 days ago • 23
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis Paper • 2603.20278 • Published 10 days ago • 84
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper • 2603.21065 • Published 5 days ago • 70
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 4 days ago • 110
A Subgoal-driven Framework for Improving Long-Horizon LLM Agents Paper • 2603.19685 • Published 7 days ago • 18
WorldAgents: Can Foundation Image Models be Agents for 3D World Models? Paper • 2603.19708 • Published 7 days ago • 12
ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents Paper • 2603.18815 • Published 8 days ago • 14
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Paper • 2603.19220 • Published 8 days ago • 60
AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents Paper • 2603.16496 • Published 10 days ago • 13
Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models Paper • 2603.15618 • Published 11 days ago • 21