Integrating NLP and Ontology Matching into a Unified System for Automated Information Extraction from Geological Hazard Reports
Qinjun Qiu , Zhen Huang , Dexin Xu , Kai Ma , Liufeng Tao , Run Wang , Jianguo Chen , Zhong Xie , Yongsheng Pan
Journal of Earth Science ›› 2023, Vol. 34 ›› Issue (5) : 1433 -1446.
Integrating NLP and Ontology Matching into a Unified System for Automated Information Extraction from Geological Hazard Reports
Many detailed data on past geological hazard events are buried in geological hazard reports and have not been fully utilized. The growing developments in geographic information retrieval and temporal information retrieval offer opportunities to analyse this wealth of data to mine the spatiotemporal evolution of geological disaster occurrence and enhance risk decision making. This study presents a combined NLP and ontology matching information extraction framework for automatically recognizing semantic and spatiotemporal information from geological hazard reports. This framework mainly extracts unstructured information from geological disaster reports through named entity recognition, ontology matching and gazetteer matching to identify and annotate elements, thus enabling users to quickly obtain key information and understand the general content of disaster reports. In addition, we present the final results obtained from the experiments through a reasonable visualization and analyse the visual results. The extraction and retrieval of semantic information related to the dynamics of geohazard events are performed from both natural and human perspectives to provide information on the progress of events.
geological hazard report / spatiotemporal information / geological hazard ontology natural language processing / gazetteers / onlology / machine learning
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
Burel, G., Saif, H., Alani, H., 2017. Semantic Wide and Deep Learning for Detecting Crisis-Information Categories on Social Media. The Semantic Web-ISWC 2017: 16th International Semantic Web Conference, October 21–25, 2017, Vienna. https://doi.org/10.1007/978-3-319-68288-4_9 |
| [5] |
|
| [6] |
Chiu, J. P. C., Nichols, E., 2015. Named Entity Recognition with Bidirectional LSTM-CNNS. arXiv: 1511.08308. https://arxiv.org/abs/1511.08308 |
| [7] |
Clough, P., 2005. Extracting Metadata for Spatially-Aware Information Retrieval on the Internet. The 2005 Workshop on Geographic Information Retrieval. 4 November 2005, Bremen. https://doi.org/10.1145/1096985.1096992 |
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
Karimzadeh, M., Huang, W. Y., Banerjee, S., et al., 2013. GeoTxt: A Web API to Leverage Place References in Text. Proceedings of the 7th Workshop on Geographic Information Retrieval. November 5, 2013, Orlando. https://doi.org/10.1145/2533888.2533942 |
| [13] |
|
| [14] |
|
| [15] |
Lee, C. H., Wu, C. H., Yang, H. C., et al., 2013. Exploiting Online Social Data in Ontology Learning for Event Tracking and Emergency Response. The 2013 IEEE/ACM International Conference on |
| [16] |
Advances in Social Networks Analysis and Mining, August 25- 28, 2013, Niagara. https://doi.org/10.1145/2492517.2500260 |
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
Nguyen, D. T., Joty, S., Imran, M., et al., 2016. Applications of Online Deep Learning for Crisis Response Using Social Media Information. arXiv: 1610.01030. https://arxiv.org/abs/1610.01030 |
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
Strotgen, J., Gertz, M., Popv, P., 2010. Extraction and Exploration of Spatiotemporal Information in Documents. The 6th Workshop on Geographic Information Retrieval, February 18–19, Zurich. https://doi.org/10.1145/1722080.1722101 |
| [34] |
Strøtgen, J., Gertz, M., 2010. HeidelTime: High Quality Rule-Based Extraction and Normalization of Temporal Expressions. The 5th International Workshop on Semantic Evaluation, July 15–16, 2010, Uppsala |
| [35] |
Volz, R., Kleb, J., Mueller, W., 2007. Towards Ontology-Based Disambiguation of Geographical Identifiers. The 16th International World Wide Web Conference (WWW2007), May 8–12, 2007, Banff |
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
Yeung, C. M. A., Jatowt, A., 2011. Studying how the Past is Remembered: Towards Computational History through Large Scale Text Mining. Proceedings of the 20th ACM International Conference on Information and Knowledge Management. October 24–28, 2011, Glasgow. https://doi.org/10.1145/2063576.2063755 |
| [40] |
|
| [41] |
|
| [42] |
|
| [43] |
|
/
| 〈 |
|
〉 |