new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Mar 26

Submitted by

taesiri

GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents

·
7 authors

Submitted by

beanie00

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

MicrosoftResearch

Microsoft Research

Submitted by

jaewon040

4DGS360: 360° Gaussian Reconstruction of Dynamic Objects from a Single Video

SeoulNatlUniv

Seoul National University

1

Submitted by

zx-Wu

When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning

OPPOer

Submitted by

Seanie-lee

T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

kaist-ai

Submitted by

Agcs12

CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare

MBZUAI

Mohamed Bin Zayed University of Artificial Intelligence

Submitted by

YanAdjeNole

Can LLM Agents Be CFOs? A Benchmark for Resource Allocation in Dynamic Enterprise Environments

TheFinAI

Submitted by

Mercury7353

EVA: Efficient Reinforcement Learning for End-to-End Video Agent

·
9 authors

Submitted by

taesiri

UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

·
12 authors

Submitted by

taesiri

Toward Physically Consistent Driving Video World Models under Challenging Trajectories

·
13 authors

Submitted by

taesiri

OmniWeaving: Towards Unified Video Generation with Free-form Composition and Reasoning

·
14 authors

Submitted by

taesiri

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

·
8 authors

Submitted by

garrying

UniFunc3D: Unified Active Spatial-Temporal Grounding for 3D Functionality Segmentation

·
2 authors

2

Submitted by

fromthesky

PLDR-LLMs Reason At Self-Organized Criticality

FromtheskyResearchLabs

Fromthesky Research Labs