Papers
arxiv:2603.04750

HiMAP-Travel: Hierarchical Multi-Agent Planning for Long-Horizon Constrained Travel

Published on Mar 5
· Submitted by
wenjun
on Mar 9
Authors:
,

Abstract

A hierarchical multi-agent framework named HiMAP-Travel addresses long-horizon planning challenges in travel planning by decomposing tasks into strategic coordination and parallel execution, achieving superior performance over sequential and other multi-agent baselines.

AI-generated summary

Sequential LLM agents fail on long-horizon planning with hard constraints like budgets and diversity requirements. As planning progresses and context grows, these agents drift from global constraints. We propose HiMAP-Travel, a hierarchical multi-agent framework that splits planning into strategic coordination and parallel day-level execution. A Coordinator allocates resources across days, while Day Executors plan independently in parallel. Three key mechanisms enable this: a transactional monitor enforcing budget and uniqueness constraints across parallel agents, a bargaining protocol allowing agents to reject infeasible sub-goals and trigger re-planning, and a single policy trained with GRPO that powers all agents through role conditioning. On TravelPlanner, HiMAP-Travel with Qwen3-8B achieves 52.78% validation and 52.65% test Final Pass Rate (FPR). In a controlled comparison with identical model, training, and tools, it outperforms the sequential DeepTravel baseline by +8.67~pp. It also surpasses ATLAS by +17.65~pp and MTP by +10.0~pp. On FlexTravelBench multi-turn scenarios, it achieves 44.34% (2-turn) and 37.42% (3-turn) FPR while reducing latency 2.5x through parallelization.

Community

Paper author Paper submitter

This paper introduces HiMAP-Travel, an end-to-end RL framework for hierarchical multi-agent planning. By separating strategic coordination from parallel day-level execution, it mitigates constraint drift in long-horizon LLM agents and improves feasibility and efficiency on travel planning benchmarks.

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2603.04750 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2603.04750 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2603.04750 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.