LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls Paper • 2511.09148 • Published Nov 12, 2025 • 18
Agent2World: Learning to Generate Symbolic World Models via Adaptive Multi-Agent Feedback Paper • 2512.22336 • Published Dec 26, 2025 • 2
TourPlanner: A Competitive Consensus Framework with Constraint-Gated Reinforcement Learning for Travel Planning Paper • 2601.04698 • Published Jan 8 • 10
Curing Miracle Steps in LLM Mathematical Reasoning with Rubric Rewards Paper • 2510.07774 • Published Oct 9, 2025
TourPlanner: A Competitive Consensus Framework with Constraint-Gated Reinforcement Learning for Travel Planning Paper • 2601.04698 • Published Jan 8 • 10
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Paper • 2601.05432 • Published Jan 8 • 169
Agent2World: Learning to Generate Symbolic World Models via Adaptive Multi-Agent Feedback Paper • 2512.22336 • Published Dec 26, 2025 • 2
LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls Paper • 2511.09148 • Published Nov 12, 2025 • 18
Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench Paper • 2310.01386 • Published Oct 2, 2023
Exploring Human-Like Translation Strategy with Large Language Models Paper • 2305.04118 • Published May 6, 2023
Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models Paper • 2310.20499 • Published Oct 31, 2023 • 8
Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate Paper • 2305.19118 • Published May 30, 2023
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher Paper • 2308.06463 • Published Aug 12, 2023 • 1
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments Paper • 2403.11807 • Published Mar 18, 2024