Think Right: Learning to Mitigate Under-Over Thinking via Adaptive, Attentive Compression Paper • 2510.01581 • Published Oct 2, 2025 • 2
Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use Paper • 2603.03205 • Published Mar 3 • 13
Cog-DRIFT: Exploration on Adaptively Reformulated Instances Enables Learning from Hard Reasoning Problems Paper • 2604.04767 • Published Apr 6 • 7
Agent-BRACE: Decoupling Beliefs from Actions in Long-Horizon Tasks via Verbalized State Uncertainty Paper • 2605.11436 • Published May 12 • 1
MINTEval: Evaluating Memory under Multi-Target Interference in Long-Horizon Agent Systems Paper • 2605.18565 • Published May 19 • 5
MINTEval: Evaluating Memory under Multi-Target Interference in Long-Horizon Agent Systems Paper • 2605.18565 • Published May 19 • 5
Agent-BRACE: Decoupling Beliefs from Actions in Long-Horizon Tasks via Verbalized State Uncertainty Paper • 2605.11436 • Published May 12 • 1
Agent-BRACE: Decoupling Beliefs from Actions in Long-Horizon Tasks via Verbalized State Uncertainty Paper • 2605.11436 • Published May 12 • 1