Hy-Embodied-0.5-VLA: From Vision-Language-Action Models to a Real-World Robot Learning Stack Paper • 2606.14409 • Published 5 days ago • 11
Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation Paper • 2606.17030 • Published 2 days ago • 15
Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2606.15007 • Published 5 days ago • 9
VisualClaw: A Real-Time, Personalized Agent for the Physical World Paper • 2606.16295 • Published 2 days ago • 21
DreamX-World 1.0: A General-Purpose Interactive World Model Paper • 2606.16993 • Published 2 days ago • 86
FastContext: Training Efficient Repository Explorer for Coding Agents Paper • 2606.14066 • Published 5 days ago • 77
Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents Paper • 2606.06036 • Published 13 days ago • 63
SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks Paper • 2606.09669 • Published 9 days ago • 42
WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces Paper • 2606.09426 • Published 9 days ago • 100
SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning Paper • 2606.13673 • Published 6 days ago • 96
EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments Paper • 2606.13681 • Published 6 days ago • 135
InternVideo3: Agentify Foundation Models with Multimodal Contextual Reasoning Paper • 2606.12195 • Published 7 days ago • 22
SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research Paper • 2606.09730 • Published 9 days ago • 50
VGGT-Long: Chunk it, Loop it, Align it -- Pushing VGGT's Limits on Kilometer-scale Long RGB Sequences Paper • 2507.16443 • Published Jul 22, 2025 • 2
A Cookbook of 3D Vision: Data, Learning Paradigms, and Application Paper • 2606.04291 • Published 15 days ago • 4