Safe and Scalable Web Agent Learning via Recreated Websites Paper • 2603.10505 • Published 21 days ago • 26
BabyLM Turns 4: Call for Papers for the 2026 BabyLM Workshop Paper • 2602.20092 • Published Feb 23 • 1
Do Vision-Language Models Respect Contextual Integrity in Location Disclosure? Paper • 2602.05023 • Published Feb 4 • 2
HiKE: Hierarchical Evaluation Framework for Korean-English Code-Switching Speech Recognition Paper • 2509.24613 • Published Sep 29, 2025 • 4
Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning Paper • 2510.04072 • Published Oct 5, 2025 • 4
PoGDiff: Product-of-Gaussians Diffusion Models for Imbalanced Text-to-Image Generation Paper • 2502.08106 • Published Feb 12, 2025
Learning to Reason as Action Abstractions with Scalable Mid-Training RL Paper • 2509.25810 • Published Sep 30, 2025 • 6
MI-HGNN: Morphology-Informed Heterogeneous Graph Neural Network for Legged Robot Contact Perception Paper • 2409.11146 • Published Sep 17, 2024 • 1
Beyond Markovian: Reflective Exploration via Bayes-Adaptive RL for LLM Reasoning Paper • 2505.20561 • Published May 26, 2025 • 7
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer Paper • 2405.16436 • Published May 26, 2024 • 1