Synthetic population and urban mobility modeling using open and publicly available data - a Washington, DC COVID case study
Shiyang Ruan , Tunaggina Khan , Mengfei Xin , Andreas Züfle , Dieter Pfoser
Computational Urban Science ›› 2025, Vol. 5 ›› Issue (1) : 57
Synthetic population and urban mobility modeling using open and publicly available data - a Washington, DC COVID case study
Human behavior changes especially when facilitated and amplified by policy changes during situations such as the COVID pandemic can have far reaching consequences for urban mobility. We have seen orphaned roads and empty metro cars during times when “rush hour” should have been the norm. To better comprehend these phenomena, this work uses Agent-based modeling (ABM), specifically the MATSim framework, in combination with a wealth of publicly available data (CENSUS and Openstreetmap - OSM) to model pre- and during COVID urban mobility for the Washington, DC metro area. The available CENSUS data combined with population generation algorithms and MATSim allows us to model a population of four million people and their daily mobility patterns on a multimodal transportation network that includes a road network, the metro system, and a number of bus services in the Washington, DC metro area. In comparing the simulation output with ground-truth flows, we show that indeed our approach is capable to accurately capturing traffic flows in this multimodal network as observed before and during the COVID pandemic. Overall, this work demonstrates that publicly available population data (CENSUS) and transportation infrastructure and POI data (OSM) can be leveraged in a powerful simulation framework to accurately model urban mobility. Example scenarios could be the evaluation of future policy proposals as well as infrastructure projects that leverage mobility patterns.
Digital twin / Urban mobility / Microsimulation / Agent-based modeling
| [1] |
|
| [2] |
|
| [3] |
Balac, M., & Hörl, S. (2021). Synthetic population for the state of California based on open-data: examples of San Francisco Bay area and San Diego County. 100th Annual Meeting of the Transportation Research Board (TRB), Jan 2021, Washington, D.C. (virtual), United States. https://hal.science/hal-03208848/. |
| [4] |
Bhandari, P., Anastasopoulos, A., & Pfoser, D. (2024). Urban mobility assessment using llms. In Proceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems, SIGSPATIAL ’24 (pp. 67–79). Association for Computing Machinery. |
| [5] |
|
| [6] |
Castiglione, J., Bradley, M., & Gliebe, J. (2015). Activity-based travel demand models: a primer (No. SHRP 2 Report S2-C46-RR-1). |
| [7] |
Census Bureau, U.S. Public use microdata areas (pumas). https://www.census.gov/programs-surveys/geography/guidance/geo-areas/pumas.html. Accessed 6 June 2022. |
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
de Dios Ortúzar, J. & Willumsen, L. G. (2011). Modelling transport. John wiley & sons. |
| [12] |
|
| [13] |
Federal Highway Administration, Department of Transpoation. National household travel survey (rts). https://https://nhts.ornl.gov/. Accessed 6 June 2024. |
| [14] |
Gkountouna, O., Pfoser, D., & Züfle, A. (2020). Traffic flow estimation using probe vehicle data. In 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA) (pp. 579–588). IEEE. |
| [15] |
|
| [16] |
Gurcan, O. Llm-augmented agent-based modelling for social simulations: Challenges and opportunities (2024). arXiv:2405.06700. |
| [17] |
Harris, M. A., & Branion-Calles, M. (2021). Changes in commute mode attributed to COVID-19 risk in Canadian national survey data. Findings (Sydney (N.S.W.). https://doi.org/10.32866/001c.19088. |
| [18] |
|
| [19] |
Horni, A., Nagel, K., & Axhausen, K. (2016). The Multi-Agent Transport Simulation MATSim. Ubiquity Press. |
| [20] |
Hsu, S.-L., Tung, E., Krumm, J., Shahabi, C., & Shafique, K. (2024). Trajgpt: Controlled synthetic trajectory generation using a multitask transformer-based spatiotemporal model. In Proceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems, SIGSPATIAL ’24 (pp. 362–371). Association for Computing Machinery. |
| [21] |
|
| [22] |
Javadinasr, M., Magassy, T. B., Rahimi, E., Davatgari, A., Salon, D., Bhagat-Conway, M. W., Chauhan, R. S., Pendyala, R. M., Derrible, S., Khoeini, S., et al. (2021). The enduring effects of covid-19 on travel behavior in the united states: A panel study on observed and expected changes in telecommuting, mode choice, online shopping and air travel. arXiv preprint arXiv:2109.07988. |
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
Klein, B., LaRock, T., McCabe, S., Torres, L., Friedland, L., Privitera, F., Lake, B., Kraemer, M. U., Brownstein, J. S., Lazer, D., et al. (2020). Reshaping a nation: Mobility, commuting, and contact patterns during the covid-19 outbreak. Northeastern University-Network Science Institute Report. |
| [28] |
|
| [29] |
Lim, P. P. (2020). Population synthesis for travel demand modelling in Australian capital cities. PhD dissertation, The University of Queensland. |
| [30] |
|
| [31] |
Lopez, P. A., Behrisch, M., Bieker-Walz, L., Erdmann, J., Flötteröd, Y.-P., Hilbrich, R., Lücken, L., Rummel, J., Wagner, P., & Wießner, E. (2018). Microscopic traffic simulation using sumo. In 2018 21st international conference on intelligent transportation systems (ITSC) (pp. 2575–2582). IEEE. |
| [32] |
|
| [33] |
|
| [34] |
Maryland State Department of Education (2020). State superintendent salmon announces temporary closure of maryland public schools. maryland.gov. |
| [35] |
Mbuya, J. K., Pfoser, D., & Anastasopoulos, A. (2024). Trajectory anomaly detection with language models. In Proceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems, SIGSPATIAL ’24 (pp. 208–219). Association for Computing Machinery. |
| [36] |
Meckler, L. (2020). Seven states, d.c. order all schools closed in effort to prevent spread of covid-19. The Washington Post. |
| [37] |
Mobility Analytics Research Group (MARG). Popgen: Synthetic population generator. http://www.mobilityanalytics.org/popgen.html. Accessed 6 June 2022. |
| [38] |
|
| [39] |
Natanson, H. (2020). Virginia closes schools for the year, sending districts and families scrambling. The Washington Post. |
| [40] |
Ozaydin, O., & Ulengin, F. (2020). Impacts of COVID-19 on the Transport Sector and Measures as Well as Recommendations of Policies and Future Research: A Report on Turkey. Available at SSRN: http://dx.doi.org/10.2139/ssrn.3686628. |
| [41] |
|
| [42] |
|
| [43] |
RECKER, J. (2020). Dc public schools are closed for the rest of the school year. Washingtonian. |
| [44] |
|
| [45] |
SafeGraph. Covid19-commerce-pattern. https://www.safegraph.com/data-examples/covid19-commerce-pattern. Accessed 6 June 2022. |
| [46] |
SafeGraph. Store visit attribution: Importance, methods, & where to get data. https://www.safegraph.com/guides/visit-attribution. Accessed 21 June 2022. |
| [47] |
Sallard, A., Balać, M., & Hörl, S. (2020). A synthetic population for the greater São Paulo metropolitan region. ETH Zurich. https://doi.org/10.3929/ETHZ-B-000429951. |
| [48] |
|
| [49] |
|
| [50] |
Snowdon, J., Gkountouna, O., Züfle, A., & Pfoser, D. (2018). Spatiotemporal traffic volume estimation model based on gps samples. In Proceedings of the Fifth International ACM SIGMOD Workshop on Managing and Mining Enriched Geo-Spatial Data (pp. 1–6). |
| [51] |
Solomou, S., & Sengupta, U. (2024). Simulating complex urban behaviours with ai: Incorporating improved intelligent agents in urban simulation models. Urban Planning, 10. |
| [52] |
Müller, J.; Straub, M.; Richter, G.; Rudloff, C. (2022). Integration of Different Mobility Behaviors and Intermodal Trips in MATSim. Sustainability, 14, 428. https://doi.org/10.3390/su14010428. |
| [53] |
|
| [54] |
United States Office of Management and Budget (OMB). North american industry classification system - naics. https://www.census.gov/naics/. Accessed 21 June 2022. |
| [55] |
U.S. Census Bureau. American community survey, 2019 american community survey 5-year estimates, table s0101. https://data.census.gov/cedsci/table?q=ACSST5Y2019.S0101. Accessed 6 June 2022. |
| [56] |
U.S. Census Bureau, Center for Economic Studies. Lehd origin-destination employment statistics (lodes). US Census Bureau Center for Economic Studies Publications and reports page,https://lehd.ces.census.gov/data/. Accessed 6 June 2022. |
| [57] |
|
| [58] |
Wang, J., Xiong, J., Yang, K., Peng, S., & Xu, Q. (2010). Use of GIS and agent-based modeling to simulate the spread of influenza. In 2010 18th International Conference on Geoinformatics (pp. 1–6). |
| [59] |
|
| [60] |
Washington Metropolitan Area Transit Authority (WMATA). Rail ridership data viewer. https://www.wmata.com/initiatives/ridership-portal/Metrorail-Ridership-Summary.cfm. Accessed 6 Mar 2025. |
| [61] |
Washington Metropolitan Area Transit Authority (WMATA). Transit feed. https://transitfeeds.com/feeds. Accessed 6 June 2022. |
| [62] |
|
| [63] |
|
| [64] |
|
| [65] |
Zhang, L., Pfoser, D., & Züfle, A. (2020). Station-to-user transfer learning: Towards explainable user clustering through latent trip signatures using tidal-regularized non-negative matrix factorization. In Proceedings of the 28th International Conference on Advances in Geographic Information Systems (pp. 303–313). |
| [66] |
|
The Author(s)
/
| 〈 |
|
〉 |