Machine learning-based hydrological models for flash floods: a systematic literature review
Leonardo B. L. Santos , Elton V. Escobar-Silva , Luiz F. Satolo , Ricardo S. Oyarzabal , Michael M. Diniz , Rogério G. Negri , Glauston R. T. Lima , Stephan Stephany , Jaqueline A. J. P. Soares , Johan S. Duque , Fernando L. Saraiva Filho , Angélica N. Caseri , Luiz Bacelar
Smart Construction and Sustainable Cities ›› 2025, Vol. 3 ›› Issue (1) : 21
Machine learning-based hydrological models for flash floods: a systematic literature review
Flash floods are critical events for emergency management, yet their modeling remains highly challenging, even in smart cities approaches. Physically based hydrological models are often unsuitable at small spatiotemporal scales due to their computational complexity and dependence on detailed local parameters, which are rarely available during flash floods. With the growing availability of hydrological data, machine learning (ML) has emerged as a promising alternative. This work performs a Systematic Literature Review (SLR) to improve our understanding of the research landscape on ML applications for flash flood forecasting, a significant subset of flash flood modeling. From more than 1,200 papers published until January 2024 in Web of Science, SCOPUS/Elsevier, Springer/Nature, and Wiley, 50 were selected following PRISMA guidelines. The inclusion and exclusion criteria removed reviews, retractions, papers focused on post-flood damage assessment (not forecasting), and those with time resolutions of 6 hours or more, retaining only studies with fine-scale temporal data (<6 hours). For each paper, we extracted information on forecasting horizon, study area size, input data, ML techniques, and outcomes (regression or classification). Results show a sharp rise in ML-based flash flood research, with China leading (38%). Nearly all studies rely on rainfall, discharge, and water level data - often in combination. Long short-term memory (LSTM) networks dominate (60%). Unfortunately, only 10% of the selected studies provide access to their datasets. This lack of transparency poses a major barrier to reproducibility, inhibits fair comparative evaluation of models, and ultimately slows methodological progress in flash flood forecasting. Furthermore, our review highlights that no method consistently outperforms others. This variability in performance is likely influenced by factors such as regional hydrological characteristics (e.g., differences between arid and tropical basins), variations in input data quality, and the length of the forecast horizon (e.g., 1- vs. 6-hour prediction). Lastly, we recommend advancing this field through integration with early warning systems, creation of benchmarks, open data practices, and stronger multidisciplinary collaboration.
Artificial intelligence / Machine learning / Flash floods / Hydrological modeling / Disasters
| [1] |
WMO: (2021) WMO Atlas of Mortality and Economic Losses from Weather, Climate and Water Extremes (1970-2019). WMO-No. 1267, 90p., ISBN:978-92-63-11267-5, WMO, Geneva. https://library.wmo.int/idurl/4/57564481 |
| [2] |
EM-DAT U CRED (2024) The international disaster datase. https://public.emdat.be/data. Accessed 2 Sep 2024 |
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
Lyu H, Yin Z, Shen S, Chen X, Su D (2025) A novel framework incorporating machine learning into gis for flood susceptibility prediction of urban metro systems. Sci China Technol Sci 68(6):1–17 |
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
Lange H, Sippel S (2020) Machine learning applications in hydrology. Forest-water interactions. Springer, Cham. 240. pp 233–257 |
| [28] |
Mosaffa H, Sadeghi M, Mallakpour I, Jahromi MN, Pourghasemi HR (2022) Application of machine learning algorithms in hydrology. In: Computers in earth and environmental sciences. Elsevier, pp 585–591 |
| [29] |
|
| [30] |
|
| [31] |
Ardabili S, Mosavi A, Dehghani M, Várkonyi-Kóczy AR (2020) Deep learning and machine learning in hydrological processes climate change and earth systems a systematic review. In: Engineering for Sustainable Future: Selected papers of the 18th International Conference on Global Research and Education Inter-Academia–2019 18. Springer, pp 52–62 |
| [32] |
|
| [33] |
Page MJ, Moher D, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, Shamseer L, Tetzlaff JM, Akl EA, Brennan SE, et al (2021) Prisma 2020 explanation and elaboration: updated guidance and exemplars for reporting systematic reviews. BMJ 372:1–36 |
| [34] |
Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, Shamseer L, Tetzlaff JM, Akl EA, Brennan SE, et al (2021) The prisma 2020 statement: an updated guideline for reporting systematic reviews. BMJ 372:1–9 |
| [35] |
|
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
|
| [40] |
|
| [41] |
|
| [42] |
|
| [43] |
|
| [44] |
|
| [45] |
|
| [46] |
|
| [47] |
|
| [48] |
|
| [49] |
|
| [50] |
|
| [51] |
|
| [52] |
|
| [53] |
|
| [54] |
|
| [55] |
|
| [56] |
Whittle P (1951) Hypothesis testing in time series analysis. Uppsala: Almquist Wiksell, 121p., Doctorate thesis, University of Uppsala |
| [57] |
|
| [58] |
|
| [59] |
|
| [60] |
Prokhorenkova L, Gusev G, Vorobev A, Dorogush AV, Gulin A (2018) CatBoost: unbiased boosting with categorical features. Adv Neural Inf Process Syst 31:1–11 |
| [61] |
|
| [62] |
|
| [63] |
|
| [64] |
|
| [65] |
|
| [66] |
|
| [67] |
|
| [68] |
Shi X, Chen Z, Wang H, Yeung DY, Wong WK, Woo Wc (2015) Convolutional lstm network: a machine learning approach for precipitation nowcasting. Adv Neural Inf Process Syst 28:1–9 |
| [69] |
Sabour S, Frosst N, Hinton GE (2017) Dynamic routing between capsules. Adv Neural Inf Process Syst 30:1–11 |
| [70] |
|
| [71] |
|
| [72] |
|
| [73] |
|
| [74] |
|
| [75] |
|
| [76] |
|
| [77] |
|
| [78] |
Wang H, Xu S, Xu H, Wu Z, Wang T, Ma C (2023) Rapid prediction of urban flood based on disaster-breeding environment clustering and bayesian optimized deep learning model in the coastal city. Sustain Cities Soc 99(104):898. https://doi.org/10.1016/j.scs.2023.104898. Publisher: Elsevier Ltd |
| [79] |
|
| [80] |
|
| [81] |
Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2020) Generative adversarial networks. ACM New York, NY, |
| [82] |
Chung J, Gulcehre C, Cho K, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555 |
| [83] |
|
| [84] |
|
| [85] |
|
| [86] |
Fix E (1985) Discriminatory analysis: nonparametric discrimination, consistency properties, vol 1. USAF School of Aviation Medicine |
| [87] |
Graves A (2012) Long short-term memory. In: Supervised Sequence Labelling with Recurrent Neural Networks. Studies in Computational Intelligence. Springer, Berlin, Heidelberg. vol 385. |
| [88] |
|
| [89] |
|
| [90] |
Ho HV, Nguyen DH, Le XH, Lee G (2022) Multi-step-ahead water level forecasting for operating sluice gates in hai duong. Vietnam Environ Monit Assess 194(6):442. https://doi.org/10.1007/s10661-022-10115-7. Place: Cham Publisher: Cham: Springer International Publishing |
| [91] |
|
| [92] |
Cui Z, Guo S, Zhou Y, Wang J (2023) Exploration of dual-attention mechanism-based deep learning for multi-step-ahead flood probabilistic forecasting. Journal of hydrology (Amsterdam) 622(129):688. https://doi.org/10.1016/j.jhydrol.2023.129688. Publisher: Elsevier B.V |
| [93] |
|
| [94] |
|
| [95] |
|
| [96] |
|
| [97] |
|
| [98] |
|
| [99] |
|
| [100] |
|
| [101] |
|
| [102] |
|
| [103] |
|
| [104] |
|
| [105] |
|
| [106] |
|
| [107] |
|
| [108] |
|
| [109] |
|
| [110] |
|
| [111] |
|
| [112] |
|
| [113] |
|
| [114] |
|
| [115] |
|
| [116] |
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. Adv Neural Inf Process Syst 30:1–11 |
| [117] |
|
| [118] |
Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3(Mar):1157–1182 |
| [119] |
|
| [120] |
|
| [121] |
Moges E, Demissie Y, Larsen L, Yassin F (2021) Review: sources of hydrological model uncertainties and advances in their analysis. Water 13(1). https://doi.org/10.3390/w13010028 |
| [122] |
|
| [123] |
Liu Z, Felton T, Mostafavi A (2023) Identifying flash flood hotspots with explainable machine learning using urban features. In: Proceedings of the 1st ACM SIGSPATIAL International Workshop on Advances in Urban-AI, , UrbanAI ’23. Association for Computing Machinery, New York, pp 6–9. https://doi.org/10.1145/3615900.3628792 |
| [124] |
|
| [125] |
|
| [126] |
|
| [127] |
|
| [128] |
|
| [129] |
|
| [130] |
Prakash C, Barthwal A, Acharya D (2023) Retracted article: Floodalert: an internet of things based real-time flash flood tracking and prediction system. Multimedia Tools and Applications 82(28):43701–43727 |
| [131] |
|
The Author(s)
/
| 〈 |
|
〉 |