ClawBench: Can AI Agents Complete Everyday Online Tasks? Paper • 2604.08523 • Published 10 days ago • 255
Structured Distillation of Web Agent Capabilities Enables Generalization Paper • 2604.07776 • Published 10 days ago • 20
VideoZeroBench: Probing the Limits of Video MLLMs with Spatio-Temporal Evidence Verification Paper • 2604.01569 • Published 17 days ago • 13
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published 19 days ago • 340
AVControl: Efficient Framework for Training Audio-Visual Controls Paper • 2603.24793 • Published 24 days ago • 26
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 29 days ago • 338