Focused Large Language Models are Stable Many-Shot Learners Paper • 2408.13987 • Published Aug 26, 2024
Do Not Waste Your Rollouts: Recycling Search Experience for Efficient Test-Time Scaling Paper • 2601.21684 • Published Jan 29 • 10
On Time, Within Budget: Constraint-Driven Online Resource Allocation for Agentic Workflows Paper • 2605.06110 • Published about 1 month ago • 16
Share More, Search Less: Collaborative Parallel Thinking for Efficient Test-Time Scaling Paper • 2605.27030 • Published 11 days ago • 31
Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling Paper • 2506.15707 • Published Oct 20, 2025
Breaking the Self-Confirming Loop: Diagnosing and Mitigating Systemic Reward Bias in Self-Rewarding RL Paper • 2510.08977 • Published Oct 10, 2025 • 1
Share More, Search Less: Collaborative Parallel Thinking for Efficient Test-Time Scaling Paper • 2605.27030 • Published 11 days ago • 31
Do Not Waste Your Rollouts: Recycling Search Experience for Efficient Test-Time Scaling Paper • 2601.21684 • Published Jan 29 • 10