AlphaTransit: Learning to Design City-scale Transit Routes
Abstract
AlphaTransit combines Monte Carlo Tree Search with neural policy-value networks to optimize bus route design by predicting downstream quality and enabling lookahead decisions without simulator rollouts.
Designing a transit network requires many sequential route extension decisions, but their quality is often visible only after the full network is assembled. This delayed-feedback challenge lies at the heart of the Transit Route Network Design Problem (TRNDP), where route interactions can be deceptive: an extension that appears useful locally can create transfer bottlenecks, produce redundant overlap, or reduce overall throughput. To guide route construction under delayed simulator feedback, we introduce AlphaTransit, a search-based planning framework for cityscale bus network design. AlphaTransit couples Monte Carlo Tree Search (MCTS) with a neural policy-value network: the policy proposes route extensions, the value estimates downstream design quality, and search uses these predictions to refine each decision. This provides decision-time lookahead during route construction without running simulator rollouts inside the search tree. We evaluate AlphaTransit on a new Bloomington TRNDP benchmark with realistic road topology and censusderived demand, under mixed and full transit demand settings. In the Bloomington network, AlphaTransit attains the highest service rate in both demand settings, reaching 54.6% and 82.1%, respectively. Relative to reinforcement learning without search, these correspond to 9.9% and 11.4% service rate gains; relative to MCTS without learned guidance, they correspond to 2.5% and 11.2% gains. These results suggest that coupling learned guidance with MCTS is more effective than using either approach alone for transit network design. Our code and data are publicly available in https://github.com/poudel-bibek/AlphaTransit.
Community
Hi everyone, I’m one of the authors. Happy to answer questions about the method, evaluation setup, or reproducibility.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- DeCoR: Design and Control Co-Optimization for Urban Streets Using Reinforcement Learning (2026)
- Learning to Route Electric Trucks Under Operational Uncertainty (2026)
- Two-Stage Learned Decomposition for Scalable Routing on Multigraphs (2026)
- Generalising Travel Time Prediction To Varying Route Choices In Urban Networks (2026)
- Rethinking Positional Encoding for Neural Vehicle Routing (2026)
- TransitLM: A Large-Scale Dataset and Benchmark for Map-Free Transit Route Generation (2026)
- Graph-Conditioned Mixture of Graph Neural Network Experts for Traffic Forecasting (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2605.28730 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 1
Datasets citing this paper 1
matrix-multiply/bloomington-tndp
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper