Data Architectures for AI-Ready Interoperable Public Transportation Ecosystems
Abstract
Public transportation (PT) agencies generate vast amounts of heterogeneous data from automatic fare collection (AFC), automatic passenger counting (APC), vehicle location (AVL/CAD), schedule and real-time feeds (GTFS/GTFS-RT), and proprietary platforms. These datasets offer unprecedented opportunities for data-driven planning, operations, and passenger services, but their potential is constrained by fragmentation, inconsistent update frequencies, and the lack of reproducible, interoperable pipelines. While contemporary data platform patterns and architectural styles from enterprise computing address analogous challenges in other sectors, their adaptation to the PT domain remains mostly underexplored. Transit systems present unique conditions, including the convergence of Information Technology (IT) and Operational Technology (OT), long asset lifecycles, rigorous security requirements, multi-agency coordination requirements, and the need to operate on live systems that preclude controlled experimentation.
Get this paper in your agent:
hf papers read 2606.00057 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper