FastMix: Fast Data Mixture Optimization via Gradient Descent Paper • 2606.14971 • Published 12 days ago • 3
FastMix: Fast Data Mixture Optimization via Gradient Descent Paper • 2606.14971 • Published 12 days ago • 3
DreamOmni2: Multimodal Instruction-based Editing and Generation Paper • 2510.06679 • Published Oct 8, 2025 • 74
Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning Paper • 2510.19807 • Published Oct 22, 2025 • 1
SearchGym: Bootstrapping Real-World Search Agents via Cost-Effective and High-Fidelity Environment Simulation Paper • 2601.14615 • Published Jan 21 • 1
SmartSwitch: Advancing LLM Reasoning by Overcoming Underthinking via Promoting Deeper Thought Exploration Paper • 2510.19767 • Published Oct 22, 2025 • 1
Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation Paper • 2507.08441 • Published Jul 11, 2025 • 63