Learning Goal-Oriented Language-Guided Navigation with Self-Improving Demonstrations at Scale Paper • 2509.24910 • Published Sep 29, 2025 • 4
VLNVerse: A Benchmark for Vision-Language Navigation with Versatile, Embodied, Realistic Simulation and Evaluation Paper • 2512.19021 • Published Dec 22, 2025
LiveWorld: Simulating Out-of-Sight Dynamics in Generative Video World Models Paper • 2603.07145 • Published Mar 7 • 4
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 26 days ago • 146
Rethinking Training Dynamics in Scale-wise Autoregressive Generation Paper • 2512.06421 • Published Dec 6, 2025 • 7
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts Paper • 2412.05552 • Published Dec 7, 2024 • 6
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation Paper • 2402.15852 • Published Feb 24, 2024
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models Paper • 2305.16986 • Published May 26, 2023
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models Paper • 2407.12366 • Published Jul 17, 2024 • 4