Why This Matters

Transit agencies increasingly need to leverage large-scale multimodal sensor data to optimize operations and support decision-making, but lack standardized frameworks for collecting, storing, and processing such data efficiently. Traditional approaches often fail to handle the high-volume, heterogeneous nature of real-world transit data streams. This work is innovative because it provides a practical, cloud-based architecture that enables transit agencies to build machine learning applications using real-world data while maintaining the flexibility to adapt systems as new requirements emerge.

What We Did

This paper develops efficient data management and processing frameworks for intelligent urban mobility systems using real-world transit data from Chattanooga. The work presents an integrated data architecture combining real-time vehicle telemetry, weather data, traffic information, and elevation maps to support machine learning applications. The framework addresses challenges in storing high-velocity data streams and enabling both offline model training and real-time inference for transit applications including energy prediction and ridership forecasting.

Key Results

The framework successfully demonstrates integration of multiple data sources into a unified system supporting both offline training and real-time inference. Results show that the architecture enables effective machine learning-based energy prediction for electric vehicles and transit optimization applications. The system has been deployed with the Chattanooga Area Regional Transportation Authority and provides the foundation for multiple AI applications including occupancy prediction and route-level energy consumption estimation.

Full Abstract

Cite This Paper

@inproceedings{wilbur21,
  author = {Wilbur, Michael and Pugliese, Philip and Laszka, Aron and Dubey, Abhishek},
  booktitle = {Proceedings of the Workshop on AI for Urban Mobility at the 35th AAAI Conference on Artificial Intelligence (AAAI-21)},
  title = {Efficient Data Management for Intelligent Urban Mobility Systems},
  year = {2021},
  abstract = {Modern intelligent urban mobility applications are underpinned by large-scale, multivariate, spatiotemporal data streams. Working with this data presents unique challenges of data management, processing and presentation that is often overlooked by researchers. Therefore, in this work we present an integrated data management and processing framework for intelligent urban mobility systems currently in use by our partner transit agencies. We discuss the available data sources and outline our cloud-centric data management and stream processing architecture built upon open-source publish-subscribe and NoSQL data stores. We then describe our data-integrity monitoring methods. We then present a set of visualization dashboards designed for our transit agency partners. Lastly, we discuss how these tools are currently being used for AI-driven urban mobility applications that use these tools.},
  contribution = {colab},
  tag = {ai4cps,transit},
  keywords = {intelligent transportation systems, data management, machine learning, real-time systems, energy prediction, transit optimization, data architecture}
}
Quick Info
Year 2021
Keywords
intelligent transportation systems data management machine learning real-time systems energy prediction transit optimization data architecture
Research Areas
transit energy ML for CPS middleware
Search Tags

Efficient, Data, Management, Intelligent, Urban, Mobility, Systems, intelligent transportation systems, data management, machine learning, real-time systems, energy prediction, transit optimization, data architecture, transit, energy, ML for CPS, middleware, 2021, Wilbur, Pugliese, Laszka, Dubey