Holistic Data Scheduler for LLM Pre-training via Multi-Objective Reinforcement Learning Paper • 2606.24133 • Published 3 days ago • 5
ReMMD: Realistic Multilingual Multi-Image Agentic Verification for Multimodal Misinformation Detection Paper • 2606.24112 • Published 3 days ago • 3
AC-ODM: Actor--Critic Online Data Mixing for Sample-Efficient LLM Pretraining Paper • 2505.23878 • Published 12 days ago • 1
Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation Paper • 2602.01756 • Published Feb 2 • 23