EvoTrainer: Co-Evolving LLM Policies and Training Harnesses for Autonomous Agentic Reinforcement Learning Paper • 2606.03108 • Published 11 days ago • 10
Breaking the Bubble: Asynchronous Pipeline Parallel Training with Bounded Weight Inconsistency Paper • 2606.07881 • Published 8 days ago • 7
Embodied-R1.5: Evolving Physical Intelligence via Embodied Foundation Models Paper • 2606.11324 • Published 4 days ago • 10
Verifiable Environments Are LEGO Bricks: Recursive Composition for Reasoning Generalization Paper • 2606.12373 • Published 3 days ago • 7
World Model Self-Distillation: Training World Models to Solve General Tasks Paper • 2606.12072 • Published 3 days ago • 7
POISE: Position-Aware Undetectable Skill Injection on LLM Agents Paper • 2606.07943 • Published 7 days ago • 4
Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training Paper • 2606.11854 • Published 3 days ago • 3
FlowLet: Conditional 3D Brain MRI Synthesis using Wavelet Flow Matching Paper • 2601.05212 • Published 5 days ago • 1
τ-Rec: A Verifiable Benchmark for Agentic Recommender Systems Paper • 2606.10156 • Published 5 days ago • 1
DRIFT: A Residual Flow Adapter for Decoding Continuous Outputs in Vision-Language Models Paper • 2606.05758 • Published 9 days ago • 5
Lius: Translation Model Based Instructional Lingustic Using Continual Instruction Tuning In Kupang Malay Paper • 2606.11786 • Published 3 days ago • 2
Adaptive Multi-Resolution Procedural Knowledge Compression for Large Language Models Paper • 2606.12203 • Published 3 days ago • 2
Time-Series Foundation Model Embeddings for Remaining Useful Life Estimation Paper • 2606.11990 • Published 3 days ago • 3
Large Language Models Are Overconfident in Their Own Responses Paper • 2606.03437 • Published 11 days ago • 3
i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models Paper • 2606.11289 • Published 4 days ago • 8
Reason, Then Re-reason: Cross-view Revisiting Improves Spatial Reasoning Paper • 2606.11683 • Published 3 days ago • 28