HarnessBridge: Learnable Bidirectional Controller for LLM Agent Harness Paper • 2606.12882 • Published 4 days ago • 10
ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning Paper • 2602.21534 • Published Feb 25 • 26
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models Paper • 2307.10635 • Published Jul 20, 2023 • 9