arxiv:2603.04750

HiMAP-Travel: Hierarchical Multi-Agent Planning for Long-Horizon Constrained Travel

Published on Mar 5

· Submitted by

wenjun on Mar 9

Upvote

Authors:

Wenjun Li ,

Abstract

A hierarchical multi-agent framework named HiMAP-Travel addresses long-horizon planning challenges in travel planning by decomposing tasks into strategic coordination and parallel execution, achieving superior performance over sequential and other multi-agent baselines.

AI-generated summary

Sequential LLM agents fail on long-horizon planning with hard constraints like budgets and diversity requirements. As planning progresses and context grows, these agents drift from global constraints. We propose HiMAP-Travel, a hierarchical multi-agent framework that splits planning into strategic coordination and parallel day-level execution. A Coordinator allocates resources across days, while Day Executors plan independently in parallel. Three key mechanisms enable this: a transactional monitor enforcing budget and uniqueness constraints across parallel agents, a bargaining protocol allowing agents to reject infeasible sub-goals and trigger re-planning, and a single policy trained with GRPO that powers all agents through role conditioning. On TravelPlanner, HiMAP-Travel with Qwen3-8B achieves 52.78% validation and 52.65% test Final Pass Rate (FPR). In a controlled comparison with identical model, training, and tools, it outperforms the sequential DeepTravel baseline by +8.67~pp. It also surpasses ATLAS by +17.65~pp and MTP by +10.0~pp. On FlexTravelBench multi-turn scenarios, it achieves 44.34% (2-turn) and 37.42% (3-turn) FPR while reducing latency 2.5x through parallelization.

View arXiv page View PDF Add to collection

Community

wenjun-li

Paper author Paper submitter about 10 hours ago

This paper introduces HiMAP-Travel, an end-to-end RL framework for hierarchical multi-agent planning. By separating strategic coordination from parallel day-level execution, it mitigates constraint drift in long-horizon LLM agents and improves feasibility and efficiency on travel planning benchmarks.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2603.04750 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2603.04750 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2603.04750 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.