ParEVO: Synthesizing Code for Irregular Data: High-Performance Parallelism through Agentic Evolution Paper • 2603.02510 • Published 2 days ago • 1
Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration? Paper • 2603.03202 • Published 1 day ago • 1
AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation Paper • 2602.17100 • Published 14 days ago • 2
QEDBENCH: Quantifying the Alignment Gap in Automated Evaluation of University-Level Mathematical Proofs Paper • 2602.20629 • Published 9 days ago • 2
Kiwi-Edit: Versatile Video Editing via Instruction and Reference Guidance Paper • 2603.02175 • Published 2 days ago • 14
PRISM: Pushing the Frontier of Deep Think via Process Reward Model-Guided Inference Paper • 2603.02479 • Published 2 days ago • 18
Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models Paper • 2603.01571 • Published 3 days ago • 28
BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing? Paper • 2603.03194 • Published 1 day ago • 46
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 1 day ago • 49
UniG2U-Bench: Do Unified Models Advance Multimodal Understanding? Paper • 2603.03241 • Published 1 day ago • 73
Synthetic Visual Genome 2: Extracting Large-scale Spatio-Temporal Scene Graphs from Videos Paper • 2602.23543 • Published 6 days ago • 2
SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation Paper • 2602.23359 • Published 6 days ago • 3
CharacterFlywheel: Scaling Iterative Improvement of Engaging and Steerable LLMs in Production Paper • 2603.01973 • Published 2 days ago • 5
Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training Paper • 2603.02208 • Published 2 days ago • 4