One-step Latent-free Image Generation with Pixel Mean Flows Paper • 2601.22158 • Published 6 days ago • 15
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning Paper • 2602.01058 • Published 3 days ago • 31
metaTextGrad: Automatically optimizing language model optimizers Paper • 2505.18524 • Published May 24, 2025 • 1
metaTextGrad: Automatically optimizing language model optimizers Paper • 2505.18524 • Published May 24, 2025 • 1
MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning Paper • 2410.14972 • Published Oct 19, 2024 • 1
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization Paper • 2402.14528 • Published Feb 22, 2024 • 1
Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning? Paper • 2307.07837 • Published Jul 15, 2023 • 1
A Survey on Vision-Language-Action Models: An Action Tokenization Perspective Paper • 2507.01925 • Published Jul 2, 2025 • 39