Commuting flow prediction using OpenStreetMap data
Kuldip Singh Atwal , Taylor Anderson , Dieter Pfoser , Andreas Züfle
Computational Urban Science ›› 2025, Vol. 5 ›› Issue (1) : 2
Commuting flow prediction using OpenStreetMap data
Accurately predicting commuting flows is crucial for sustainable urban planning and preventing disease spread due to human mobility. While recent advancements have produced effective models for predicting these recurrent flows, the existing methods rely on datasets exclusive to a few study areas, limiting the transferability to other locations. This research broadens the utility of state-of-the-art commuting flow prediction models with globally available OpenStreetMap data while achieving prediction accuracy comparable to location-specific and proprietary data. We show that the types of buildings, residential and non-residential, are a strong indicator for predicting commuting flows. Consistent with theoretical and analytical models, our experiments indicate that building types, distance, and population are the determining characteristics for mobility related to commuting. Our experiments show that predicted flows closely match ground truth flows. Our work enables accurate flow prediction using building types to support applications such as urban planning and epidemiology.
| [1] |
|
| [2] |
Alonso, W. (1971). The system of intermetropolitan population flows. Institute of Urban and Regional Development. |
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
Bast, H., Storandt, S., & Weidner, S. (2015). Fine-grained population estimation. In Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems (pp. 1–10). New York: Association for Computing Machinery. |
| [7] |
|
| [8] |
Cai, M., Pang, Y., & Sekimoto, Y. (2022). Spatial attention based grid representation learning for predicting origin–destination flow. In 2022 IEEE International Conference on Big Data (Big Data) (pp. 485–494). IEEE. |
| [9] |
Census Bureau, U. (2010). Population data. https://data.census.gov/table. Accessed 3 June 2024. |
| [10] |
Census Bureau, U. (2015). Lodes data. https://lehd.ces.census.gov/data/. Accessed 3 June 2024. |
| [11] |
Chen, T. & Guestrin, C. (2016). Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining (pp. 785–794). New York: Association for Computing Machinery. |
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
F de Arruda, H., Reia, S.M., Ruan, S., Atwal, K.S., Kavak, H., Anderson, T., & Pfoser, D. (2024). An openstreetmap derived building classification dataset for the united states. Scientific Data, 11(1), 1210. |
| [16] |
Fairfax County, G. (2024). Fairfax county open geospatial data. https://www.fairfaxcounty.gov/maps/open-geospatial-data. Accessed 3 June 2024. |
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
Koca, D., Schmöcker, J. D., & Fukuda, K. (2021). Origin-destination matrix estimation by deep learning using maps with new york case study. In 2021 7th International Conference on Models and Technologies for Intelligent Transportation Systems (MT-ITS) (pp. 1–6). IEEE. |
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
Lenormand, M., Huet, S., Gargiulo, F., & Deffuant, G. (2012). A universal model of commuting networks. PLoS ONE, 7(10), 1–7. |
| [30] |
|
| [31] |
Li, M.-H., Chen, B.-Y., & Li, C.-T. (2022). A hybird method with gravity model and nearest-neighbor search for trip destination prediction in new metropolitan areas. In 2022 IEEE International Conference on Big Data (Big Data) (pp. 6553–6560). IEEE. |
| [32] |
Liang, Y., Ouyang, K., Sun, J., Wang, Y., Zhang, J., Zheng, Y., Rosenblum, D., & Zimmermann, R. (2021). Fine-grained urban flow prediction. In Proceedings of the Web Conference 2021 (pp. 1833–1845). New York: Association for Computing Machinery. |
| [33] |
|
| [34] |
Liu, Z., Miranda, F., Xiong, W., Yang, J., Wang, Q., & Silva, C. (2020). Learning geo-contextual embeddings for commuting flow prediction. In Proceedings of the AAAI conference on artificial intelligence (vol. 34, pp. 808–816). Washington: AAAI Press. |
| [35] |
Lundberg, S. M. & Lee, S.-I. (2017). A unified approach to interpreting model predictions. In Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17 (pp. 4768–4777). Curran Associates Inc. |
| [36] |
Luxen, D. & Vetter, C. (2011). Real-time routing with openstreetmap data. In Proceedings of the 19th ACM SIGSPATIAL international conference on advances in geographic information systems (pp. 513–516). New York: Association for Computing Machinery. |
| [37] |
Masucci, A. P., Serras, J., Johansson, A., & Batty, M. (2013). Gravity versus radiation models: On the importance of scale and heterogeneity in commuting flows. Physical Review E—Statistical, Nonlinear, and Soft Matter Physics, 88(2), 022812. |
| [38] |
Morton, A., Piburn, J., & Nagle, N. (2018). Need a boost? a comparison of traditional commuting models with the xgboost model for predicting commuting flows (short paper). In 10th International Conference on Geographic Information Science (GIScience 2018). Schloss-Dagstuhl-Leibniz Zentrum für Informatik. |
| [39] |
NYC, D. o. C. P. (2015). Pluto and mappluto. https://www.nyc.gov/site/planning/data-maps/open-data/dwn-pluto-mappluto.page. Accessed 3 June 2024. |
| [40] |
N.Y.C, O. (2024). Nyc opendata. https://data.cityofnewyork.us/Housing-Development/Building-Footprints/nqwf-w8eh. Accessed 3 June 2024. |
| [41] |
OSM, c. (2024). Openstreetmap. https://www.openstreetmap.org. Accessed 3 June 2024. |
| [42] |
|
| [43] |
|
| [44] |
Robinson, C. & Dilkina, B. (2018). A machine learning approach to modeling human migration. In Proceedings of the 1st ACM SIGCAS Conference on Computing and Sustainable Societies (pp. 1–8). New York: Association for Computing Machinery. |
| [45] |
|
| [46] |
|
| [47] |
|
| [48] |
|
| [49] |
|
| [50] |
Spadon, G., Carvalho, A.C.d., Rodrigues-Jr, J.F., & Alves, L. G. (2019). Reconstructing commuters network using machine learning and urban indicators. Scientific reports, 9(1), 11801. |
| [51] |
|
| [52] |
|
| [53] |
|
| [54] |
|
| [55] |
|
| [56] |
|
| [57] |
Zeng, J., Zhang, G., Rong, C., Ding, J., Yuan, J., & Li, Y. (2022). Causal learning empowered od prediction for urban planning. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management (pp. 2455–2464). New York: Association for Computing Machinery. |
| [58] |
Zhang, J., Zheng, Y., & Qi, D. (2017). Deep spatio-temporal residual networks for citywide crowd flows prediction. In Proceedings of the AAAI conference on artificial intelligence (vol. 31). Palo Alto: AAAI Press. |
| [59] |
|
| [60] |
|
| [61] |
|
National Science Foundation(2109647)
The Author(s)
/
| 〈 |
|
〉 |