Predicting the Unpredictable: A Reproducible Framework with Open Multi-Source Data for Irregular Non-commuting OD Flows
Hongmeng Cui , Bingfeng Si , Yueqing Li , Jingwen Xue , Dazhuang Chi
Urban Rail Transit ›› 2026, Vol. 12 ›› Issue (1) : 91 -119.
Real-time prediction of dynamic origin–destination (OD) passenger flows is essential for efficient passenger flow management in urban rail transit (URT) systems. Existing studies have primarily focused on commuting OD flows, which exhibit strong regularity and are supported by abundant data samples. In contrast, non-commuting OD flows—especially those generated by irregular passengers with limited historical data—are characterized by high stochasticity and data sparsity and have received relatively little attention, with existing studies often reporting unsatisfactory predictive performance. To address these challenges, this study proposes a novel real-time OD flow prediction framework for irregular non-commuting passengers through multi-source data fusion and feature extraction. Specifically, individual-level spatiotemporal behavioral features are extracted from metro AFC data using a density-based clustering algorithm. Land-use and geo-economic data are then integrated to characterize individual travel preferences and construct a multidimensional behavioral indicator system. Building upon these features, hierarchical clustering and machine learning models are employed to perform personalized destination prediction. Empirical experiments conducted on Nanjing Metro data demonstrate that the proposed framework substantially improves prediction accuracy for non-commuting passengers and provides new insights into dynamic OD modeling. The results highlight the strong applicability and potential of the method for real-time passenger flow prediction in complex urban rail systems.
Urban rail transit / Real-time OD passenger flow prediction / Non-commuting passengers / Multi-source data fusion / Machine learning
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
Jiang J, Xu Y, He S, et al. (2020) Predication of the urban rail transit commuter flows in long trip chains. In: COTA international conference of transportation professionals |
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
Kipf TN, Welling M (2016) Semi-supervised classification with graph convolutional networks. https://doi.org/10.48550/ARXIV.1609.02907 |
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
Li W, Sui L, Zhou M, et al (2021) Short-term passenger flow forecast for urban rail transit based on multi-source data. EURASIP J Wire Commun Netw 2021(1). https://doi.org/10.1186/s13638-020-01881-4 |
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
Rubinstein RY, Kroese DP (2016) Simulation and the Monte Carlo Method. Wiley. https://doi.org/10.1002/9781118631980 |
| [37] |
|
| [38] |
Scornet E, Biau G, Vert JP (2015) Consistency of random forests. The Annals of Statistics 43(4). https://doi.org/10.1214/15-aos1321 |
| [39] |
Shi X, Chen Z, Wang H, et al (2015) Convolutional lstm network: a machine learning approach for precipitation nowcasting. https://doi.org/10.48550/ARXIV.1506.04214 |
| [40] |
Shi X, Gao Z, Lausen L, et al (2017) Deep learning for precipitation nowcasting: a benchmark and a new model. https://doi.org/10.48550/ARXIV.1706.03458 |
| [41] |
|
| [42] |
|
| [43] |
Wang H, Zhao J, Ye K, et al (2020) A destination prediction model for individual passengers in urban rail transit. In: 2020 international conference on high performance big data and intelligent systems (HPBD&IS). IEEE, https://doi.org/10.1109/hpbdis49115.2020.9130592 |
| [44] |
|
| [45] |
|
| [46] |
|
| [47] |
|
| [48] |
|
| [49] |
|
| [50] |
|
| [51] |
|
| [52] |
|
| [53] |
|
| [54] |
|
| [55] |
|
| [56] |
|
The Author(s)
/
| 〈 |
|
〉 |