When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models Paper • 2604.08546 • Published 3 days ago • 107
Emergent Compositional Communication for Latent World Properties Paper • 2604.03266 • Published 25 days ago • 5
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published 9 days ago • 347
EpochX: Building the Infrastructure for an Emergent Agent Civilization Paper • 2603.27304 • Published 14 days ago • 46
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models Paper • 2603.17051 • Published 25 days ago • 108
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published 25 days ago • 248