Attention based simplified deep residual network for citywide crowd flows prediction

Genan DAI, Xiaoyang HU, Youming GE, Zhiqing NING, Yubao LIU

PDF(969 KB)
PDF(969 KB)
Front. Comput. Sci. ›› 2021, Vol. 15 ›› Issue (2) : 152317. DOI: 10.1007/s11704-020-9194-x
RESEARCH ARTICLE

Attention based simplified deep residual network for citywide crowd flows prediction

Author information +
History +

Abstract

Crowd flows prediction is an important problem of urban computing whose goal is to predict the number of incoming and outgoing people of regions in the future. In practice, emergency applications often require less training time. However, there is a little work on how to obtain good prediction performance with less training time. In this paper, we propose a simplified deep residual network for our problem. By using the simplified deep residual network, we can obtain not only less training time but also competitive prediction performance compared with the existing similar method. Moreover, we adopt the spatio-temporal attention mechanism to further improve the simplified deep residual network with reasonable additional time cost. Based on the real datasets, we construct a series of experiments compared with the existing methods. The experimental results confirm the efficiency of our proposed methods.

Keywords

crowd flows prediction / spatio-temporal data mining / attention

Cite this article

Download citation ▾
Genan DAI, Xiaoyang HU, Youming GE, Zhiqing NING, Yubao LIU. Attention based simplified deep residual network for citywide crowd flows prediction. Front. Comput. Sci., 2021, 15(2): 152317 https://doi.org/10.1007/s11704-020-9194-x

References

[1]
Zheng Y, Capra L, Wolfson O, Yang H. Urban computing: concepts, methodologies, and applications. ACM Transactions on Intelligent Systems and Technology, 2014, 5(3): 38
CrossRef Google scholar
[2]
Zhang J, Zheng Y, Qi D. Deep spatio-temporal residual networks for citywide crowd flows prediction. In: Proceedings of AAAI Conference on Artificial Intelligence. 2017, 1655–1661
[3]
Wang L, Geng X, Ma X, Liu F, Yang Q. Crowd flow prediction by deep spatio-temporal transfer learning. 2018, arXiv preprint arXiv:1802.00386
[4]
Wu C, Yin T, Ge S, Yu K. Ensemble learning for crowd flows prediction on campus. In: Proceedings of International Conference on Smart Computing and Communication. 2017, 103–113
CrossRef Google scholar
[5]
Zhang J, Zheng Y, Qi D, Li R, Yi X. DNN-based prediction model for spatio-temporal data. In: Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. 2016
CrossRef Google scholar
[6]
Song X, Zhang Q, Sekimoto Y, Shibasaki R. Prediction of human emergency behavior and their mobility following large-scale disaster. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2014, 5–14
CrossRef Google scholar
[7]
Fan Z, Song X, Shibasaki R, Adachi R. Citymomentum: an online approach for crowd behavior prediction at a citywide level. In: Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing. 2015, 559–569
CrossRef Google scholar
[8]
Silva R, Kang S M, Airoldi E M. Predicting traffic volumes and estimating the effects of shocks in massive transportation systems. Proceedings of the National Academy of Sciences, 2015, 112(18): 5643–5648
CrossRef Google scholar
[9]
Xu Y, Kong Q J, Klette R, Liu Y. Accurate and interpretable bayesian mars for traffic flow prediction. IEEE Transactions on Intelligent Transportation Systems, 2014, 15(6): 2457–2469
CrossRef Google scholar
[10]
Bao J, He T, Ruan S, Li Y, Zheng Y. Planning bike lanes based on sharingbikes’ trajectories. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2017, 1377–1386
CrossRef Google scholar
[11]
Li Y, Zheng Y, Zhang H, Chen L. Traffic prediction in a bike-sharing system. In: Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems. 2015
CrossRef Google scholar
[12]
Kong X, Xu Z, Shen G, Wang J, Yang Q, Zhang B. Urban traffic congestion estimation and prediction based on floating car trajectory data. Future Generation Computer Systems, 2016, 61: 97–107
CrossRef Google scholar
[13]
Zheng Y, Yi X, Li M, Li R, Shan Z, Chang E, Li T. Forecasting finegrained air quality based on big data. In: Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2015, 2267–2276
CrossRef Google scholar
[14]
Hamed M M, Al-Masaeid H R, Said Z M B. Short-term prediction of traffic volume in urban arterials. Journal of Transportation Engineering, 1995, 121(3): 249–254
CrossRef Google scholar
[15]
Ding Q Y, Wang X F, Zhang X Y, Sun Z Q. Forecasting traffic volume with space-time arima model. Advanced Materials Research, 2011, 156: 979–983
CrossRef Google scholar
[16]
Bengio Y, Simard P, Frasconi P. Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on Neural Networks, 1994, 5(2): 157–166
CrossRef Google scholar
[17]
Hochreiter S, Schmidhuber J. Long short-term memory. Neural Computation, 1997, 9(8): 1735–1780
CrossRef Google scholar
[18]
Yu B, Yin H, Zhu Z. Spatio-temporal graph convolutional networks: a deep learning framework for traffic forecasting. In: Proceedings of International Joint Conferences on Artificial Intelligence. 2018, 3634–3640
CrossRef Google scholar
[19]
LeCun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 1998, 86(11): 2278–2324
CrossRef Google scholar
[20]
Shi X, Chen Z, Wang H, Yeung D Y, Wong W K, Woo W C. Convolutional LSTM network: a machine learning approach for precipitation nowcasting. In: Proceedings of Advances in Neural Information Processing Systems. 2015, 802–810
[21]
Xiong F, Shi X, Yeung D Y. Spatiotemporal modeling for crowd counting in videos. In: Proceedings of the IEEE International Conference on Computer Vision. 2017, 5151–5159
CrossRef Google scholar
[22]
He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016, 770–778
CrossRef Google scholar
[23]
Chen L, Zhang H, Xiao J, Nie L, Shao J, Liu W, Chua T S. SCA-CNN: spatial and channel-wise attention in convolutional networks for image captioning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017, 5659–5667
CrossRef Google scholar
[24]
Lu J, Xiong C, Parikh D, Socher R. Knowing when to look: adaptive attention via a visual sentinel for image captioning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017, 375–383
CrossRef Google scholar
[25]
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez A N, Kaiser Ł, Polosukhin I. Attention is all you need. In: Proceedings of Advances in Neural Information Processing Systems. 2017, 5998–6008
[26]
Bahdanau D, Chorowski J, Serdyuk D, Brakel P, Bengio Y. End-to-end attention-based large vocabulary speech recognition. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. 2016, 4945–4949
CrossRef Google scholar
[27]
Chorowski J K, Bahdanau D, Serdyuk D, Cho K, Bengio Y. Attentionbased models for speech recognition. In: Proceedings of Advances in Neural Information Processing Systems. 2015, 577–585
[28]
Zhou X, Shen Y, Zhu Y, Huang L. Predicting multi-step citywide passenger demands using attention-based neural networks. In: Proceedings of the 11th ACM International Conference onWeb Search and Data Mining. 2018, 736–744
CrossRef Google scholar
[29]
Geng X, Li Y, Wang L, Zhang L, Yang Q, Ye J, Liu Y. Spatiotemporal multi-graph convolution network for ride-hailing demand forecasting. In: Proceedings of AAAI Conference on Artificial Intelligence. 2019
CrossRef Google scholar
[30]
Veliˇckovíc P, Cucurull G, Casanova A, Romero A, Lio P, Bengio Y. Graph attention networks. In: Proceedings of International Conference on Learning Representations. 2018
[31]
Liang Y, Ke S, Zhang J, Yi X, Zheng Y. Geoman: multi-level attention networks for geo-sensory time series prediction. In: Proceedings of International Joint Conferences on Artificial Intelligence. 2018, 3428–3434
CrossRef Google scholar
[32]
Liu L, Zhang R, Peng J, Li G, Du B, Lin L. Attentive crowd flow machines. In: Proceedings of the 26th ACM International Conference on Multimedia. 2018, 1553–1561
CrossRef Google scholar
[33]
Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks. In: Proceedings of Advances in Neural Information Processing Systems. 2012, 1097–1105
[34]
Hu X, Dai G, Ge Y, Ning Z, Liu Y. A simplified deep residual network for citywide crowd flows prediction. In: Proceedings of the International Conference on Semantics, Knowledge and Grids. 2019, 60–67
CrossRef Google scholar
[35]
Kingma D P, Ba J. Adam: a method for stochastic optimization. 2014, arXiv preprint arXiv:1412.6980
[36]
Wu S, Tang Y, Zhu Y, Wang L, Xie X, Tan T. Session-based recommendation with graph neural networks. In: Proceedings of the AAAI Conference on Artificial Intelligence. 2019, 346–353
CrossRef Google scholar
[37]
Box G E P, Jenkins G M, Reinsel G C, Ljung G M. Time series analysis: forecasting and control. Journal of the Operational Research Society, 2015, 22(2): 199–201
[38]
Williams B M, Durvasula P K, Brown D E. Urban freeway traffic flow prediction: application of seasonal autoregressive integrated moving average and exponential smoothing models. Transportation Research Record, 1998, 1644(1): 132–141
CrossRef Google scholar
[39]
Lütkepohl H. Vector Autoregressive Models. Cheltenham: Edward Elgar Publishing, 2013
[40]
Hoang M X, Zheng Y, Singh A K. FCCF: forecasting citywide crowd flows based on big data. In: Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. 2016
CrossRef Google scholar

RIGHTS & PERMISSIONS

2020 Higher Education Press
AI Summary AI Mindmap
PDF(969 KB)

Accesses

Citations

Detail

Sections
Recommended

/