Extracting Natech Reports from Large Databases: Development of a Semi-Intelligent Natech Identification Framework
Xiaolong Luo , Ana Maria Cruz , Dimitrios Tzioutzios
International Journal of Disaster Risk Science ›› 2020, Vol. 11 ›› Issue (6) : 735 -750.
Natural hazard-triggered technological accidents (Natechs) refer to accidents involving releases of hazardous materials (hazmat) triggered by natural hazards. Huge economic losses, as well as human health and environmental problems are caused by Natechs. In this regard, learning from previous Natechs is critical for risk management. However, due to data scarcity and high uncertainty concerning such hazards, it becomes a serious challenge for risk managers to detect Natechs from large databases, such as the National Response Center (NRC) database. As the largest database of hazmat release incidents, the NRC database receives hazmat release reports from citizens in the United States. However, callers often have incomplete details about the incidents they are reporting. This results in many records having incomplete information. Consequently, it is quite difficult to identify and extract Natechs accurately and efficiently. In this study, we introduce machine learning theory into the Natech retrieving research, and a Semi-Intelligent Natech Identification Framework (SINIF) is proposed in order to solve the problem. We tested the suitability of two supervised machine learning algorithms, namely the Long Short-Term Memory (LSTM) and the Convolutional Neural Network (CNN), and selected the former for the development of the SINIF. According to the results, the SINIF is efficient (a total number of 826,078 records were analyzed) and accurate (the accuracy is over 0.90), while 32,841 Natech reports between 1990 and 2017 were extracted from the NRC database. Furthermore, the majority of those Natech reports (97.85%) were related to meteorological phenomena, with hurricanes (24.41%), heavy rains (19.27%), and storms (18.29%) as the main causes of these reported Natechs. Overall, this study suggests that risk managers can benefit immensely from SINIF in analyzing Natech data from large databases efficiently.
Data extraction method / Machine learning / Natechs / Natural hazards / NRC database
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
Bashar, S.S., and A.A. Mahmud. 2019. A machine learning approach for heart rate estimation from PPG signal using random forest regression algorithm. In Proceedings of 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE), 24–25 July 2019, Swat, Pakistan, 1–5. |
| [5] |
|
| [6] |
|
| [7] |
Bureau for Analysis of Industrial Risk and Pollution. 2019. Analysis, Research and Information on Accidents (ARIA). French Ministry of Ecology and Sustainable Development, Bureau for Analysis of Industrial Risk and Pollution, France. https://www.aria.developpement-durable.gouv.fr. Accessed 10 Jan 2019 (in French). |
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
Cireşan, D., U. Meier, and J. Schmidhuber. 2012. Multi-column deep neural networks for image classification. arXiv preprint arXiv:1202.2745. |
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
European Commission. 2019a. Natech accident database. Joint Research Centre, Institute for the Protection and Security of the Citizen, Italy. http://enatech.jrc.ec.europa.eu. Accessed 11 Jan 2019. |
| [20] |
European Commission. 2019b. eMars (Major Accident Reporting System) database. European Commission, Joint Research Centre, Institute for the Protection and Security of the Citizen, Italy. http://emars.jrc.ec.europa.eu. Accessed 11 Jan 2019. |
| [21] |
Fernández, S., A. Graves, and J. Schmidhuber. 2007. An application of recurrent neural networks to discriminative keyword spotting. In Proceedings of International Conference on Artificial Neural Networks, 9–13 September 2007, Porto, Portugal, 220–229. |
| [22] |
|
| [23] |
Girgin, S., and E. Krausmann. 2014. Analysis of pipeline accidents induced by natural hazards: Final report. JRC88410. Joint Research Centre, European Commission, Italy. |
| [24] |
|
| [25] |
Graves, A., and J. Schmidhuber. 2009. Offline handwriting recognition with multidimensional recurrent neural networks. In Proceedings of the 21st International Conference on Neural Information Processing Systems, ed. D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, 545–552. Red Hook, NY: Curran Associates Inc. |
| [26] |
|
| [27] |
Huang, G.-B., H. Zhou, X. Ding, and R. Zhang. 2011. Extreme learning machine for regression and multiclass classification. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 42(2): 513–529. |
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
Le, Q.V., and T. Mikolov. 2014. Distributed representations of sentences and documents. In Proceedings of the 31st International Conference on Machine Learning, 21–26 June 2014, Beijing, China. JMLR:W&CP 32: 1–5. |
| [40] |
|
| [41] |
|
| [42] |
|
| [43] |
|
| [44] |
|
| [45] |
Özgür, A., L. Özgür, and T. Güngör. 2005. Text categorization with class-based and corpus-based keyword selection. In Proceedings of 20th International Symposium on Computer and Information Sciences, 26–28 October 2005, Istanbul, Turkey, 606–615. |
| [46] |
|
| [47] |
|
| [48] |
|
| [49] |
|
| [50] |
|
| [51] |
Shin, J., Y. Kim, S. Yoon, and K. Jung. 2018a. Contextual-CNN: A novel architecture capturing unified meaning for sentence classification. In Proceedings of 2018 IEEE International Conference on Big Data and Smart Computing, 15–18 January 2018, Shanghai, China, 491–494. |
| [52] |
Shih, C.H., B.C. Yan, S.H. Liu, and B. Chen. 2018b. Investigating Siamese LSTM networks for text categorization. In Proceedings of the 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 12–15 November 2018, Honolulu, Hawaii, USA, 641–646. |
| [53] |
|
| [54] |
|
| [55] |
Sotthisopha, N., and P. Vateekul. 2018. Improving short text classification using fast semantic expansion on multichannel convolutional neural network. In Proceedings of the 19th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 27–29 June 2018, Busan, South Korea, 182–187. |
| [56] |
|
| [57] |
TNO Industrial and External Safety. 2019. Failure and ACcidents Technical information System (FACTS). the Unified Industrial & Harbour Fire Department in Rotterdam-Rozenburg, the Netherlands. http://www.factsonline.nl/. Accessed 14 Jan 2019. |
| [58] |
|
| [59] |
United States Coast Guard. 2017. United States National Response Center (NRC) database. Washington, DC: United States Coast Guard. http://www.nrc.uscg.mil/. Accessed 17 Oct 2017. |
/
| 〈 |
|
〉 |