Biological Classification System Knowledge Graph and Semi-automatic Construction of Its Invertebrate Fossil Branches

Shaochun Dong, Yukun Shi, Yizao Ran, Haijun Wu, Yiying Deng, Junxuan Fan, Xinyu Dai

Journal of Earth Science ›› 2024, Vol. 35 ›› Issue (6) : 2119-2128.

Journal of Earth Science ›› 2024, Vol. 35 ›› Issue (6) : 2119-2128. DOI: 10.1007/s12583-023-1941-y
Geoscience Big Data

Biological Classification System Knowledge Graph and Semi-automatic Construction of Its Invertebrate Fossil Branches

Author information +
History +

Abstract

Biological classification is the foundation of biology and paleontology, as it arranges all the organisms in a hierarchy that humans can easily follow and understand. It is further used to reconstruct the evolution of life. A biological classification system (BCS) that includes all the established fossil taxa would be both useful and challenging for uncovering the life history. Since fossil taxa were originally recorded in various published books and articles written by natural languages, the primary step is to organize all those taxa information in a manner that can be deciphered by a computer system. A Knowledge Graph (KG) is a formalized description framework of semantic knowledge, which represents and retrieves knowledge in a machine-understandable way, and therefore provides an eligible method to represent the BCS. In this paper, a model of the BCS KG including the ontology and fact layers is presented. To put it into practice, the ontology layer of the invertebrate fossil branches was manually developed, while the fact layer was automatically constructed by extracting information from 46 volumes of the Treatise of Invertebrate Paleontology series with the help of natural language processing technology. As a result, 27 348 taxa nodes spanning fourteen taxonomic ranks were extracted with high accuracy and high efficiency, and the invertebrate fossil branches of the BCS KG was thus installed. This study demonstrates that a properly designed KG model and its automatic construction with the help of natural language processing are reliable and efficient.

Cite this article

Download citation ▾
Shaochun Dong, Yukun Shi, Yizao Ran, Haijun Wu, Yiying Deng, Junxuan Fan, Xinyu Dai. Biological Classification System Knowledge Graph and Semi-automatic Construction of Its Invertebrate Fossil Branches. Journal of Earth Science, 2024, 35(6): 2119‒2128 https://doi.org/10.1007/s12583-023-1941-y

References

Adam-BlondonA F, AlauxM, PommierC, et al. . Towards an Open Grapevine Information System. Horticulture Research, 2016, 3: 16056
CrossRef Google scholar
Aristotle. The Complete Works of Aristotle. Vols. I and II, 1995 Princeton, NJ Princeton University Press
ArpR, SmithB, SpearA D. Building Ontologies with Basic Formal Ontology, 2015 1-25
AyadiA, SametA, de Bertrand de BeuvronF, et al. . Ontology Population with Deep Learning-Based NLP: A Case Study on the Biomolecular Network Ontology. Procedia Computer Science, 2019, 159: 572-581
CrossRef Google scholar
AyadiA, Zanni-MerkC, de Bertrand de BeuvronF, et al. . BNO—An Ontology for Understanding the Transittability of Complex Biomolecular Networks. Journal of Web Semantics, 2019, 57: 100495
CrossRef Google scholar
BentonM J. Stems, Nodes, Crown Clades, and Rank-Free Lists: Is Linnaeus Dead?. Biological Reviews, 2000, 75(4): 633-648
CrossRef Google scholar
BrowerA V, SchuhR T. Biological Systematics: Principles and Applications, 2021 Ithaca, NY Cornell University Press 435
CrossRef Google scholar
CoxS J D, RichardS M. A Formal Model for the Geologic Time Scale and Global Stratotype Section and Point, Compatible with Geospatial Information Transfer Standards. Geosphere, 2005, 1(3): 119-137
CrossRef Google scholar
EllisB F, MessinaA R. Catalogue of Foraminifera. American Museum of Natural History, 1942 New York Micropaleontology Press
EllisB F, MessinaA R. Catalogue of Ostracoda. American Museum of Natural History, 1952 New York Micropaleontology Press
FallahiG R, FrankA U, MesgariM S, et al. . An Ontological Structure for Semantic Interoperability of GIS and Environmental Modeling. International Journal of Applied Earth Observation and Geoinformation, 2008, 10(3): 342-357
CrossRef Google scholar
GruberT R. A Translation Approach to Portable Ontology Specifications. Knowledge Acquisition, 1993, 5(2): 199-220
CrossRef Google scholar
GruberT R. Toward Principles for the Design of Ontologies Used for Knowledge Sharing?. International Journal of Human-Computer Studies, 1995, 43(5/6): 907-928
CrossRef Google scholar
GuanN N, SongD D, LiaoL J. Knowledge Graph Embedding with Concepts. Knowledge-Based Systems, 2019, 164: 38-44
CrossRef Google scholar
GutierrezC, SequedaJ F. Knowledge Graphs. Communications of the ACM, 2021, 64(3): 96-104
CrossRef Google scholar
HinchliffC E, SmithS A, AllmanJ F, et al. . Synthesis of Phylogeny and Taxonomy into a Comprehensive Tree of Life. Proceedings of the National Academy of Sciences of the United States of America, 2015, 112(41): 12764-12769
CrossRef Google scholar
Hou, C. B., Liu, K. C., Wang, T. H., et al., 2024. DDE KG Editor: A Data Service System for Knowledge Graph Construction in Geoscience. Geoscience Data Journal. https://doi.org/10.1002/gdj3.245
HuX M, XuY W, MaX G, et al. . Knowledge System, Ontology, and Knowledge Graph of the Deep-Time Digital Earth (DDE): Progress and Perspective. Journal of Earth Science, 2023, 34(5): 1323-1327
CrossRef Google scholar
LinnaeusC. Species Plantarum, 1753
LinnaeusC. Systema Naturae, Sive Regna Tria Naturae Systematice Proposita per Classes, Ordines, Genera, & Species, 1758 10 Leiden Haak
MaX G. Knowledge Graph Construction and Application in Geosciences: A Review. Computers & Geosciences, 2022, 161: 105082
CrossRef Google scholar
MaX G, WuC L, CarranzaE J M, et al. . Development of a Controlled Vocabulary for Semantic Interoperability of Mineral Exploration Geodata for Mining Projects. Computers & Geosciences, 2010, 36(12): 1512-1522
CrossRef Google scholar
NechesR, FikesR, FininT, et al. . Enabling Technology for Knowledge Sharing. AI Magazine, 1991, 12(3): 36-56
QiuQ J, WangB, MaK, et al. . A Practical Approach to Constructing a Geological Knowledge Graph: A Case Study of Mineral Exploration Data. Journal of Earth Science, 2023, 34(5): 1374-1389
CrossRef Google scholar
QiuZ X, ZhangM M, WuX Z. Palaeovertebrata Sinica, 2015 Beijing Science Press (in Chinese with English Abstract)
RavikumarK E, WagholikarK B, LiuH F. Towards Pathway Curation through Literature Mining: A Case Study Using PharmGKB. Pacific Symposium on Biocomputing, 2014, 2014: 352-363
RedelingsB D, HolderM T. A Supertree Pipeline for Summarizing Phylogenetic and Taxonomic Information for Millions of Species. PeerJ, 2017, 5: e3058
CrossRef Google scholar
RuggieroM A, GordonD P, OrrellT M, et al. . A Higher Level Classification of all Living Organisms. PLoS One, 2015, 10(4): e0119248
CrossRef Google scholar
SancettaC A. Catalogue of Diatoms, 1985 New York Micropaleontology Press
Scott-RamN R. Transformed Cladistics, Taxonomy and Evolution, 1990 Cambridge Cambridge University Press 5-36
CrossRef Google scholar
SeldenP A. Treatise on Invertebrate Paleontology: A Work in Progress. PALAIOS, 2012, 27(7): 439-442
CrossRef Google scholar
ShenS Z, FanJ X, WangX D, et al. . How to Build a High-Resolution Digital Geological Timescale?. Journal of Earth Science, 2022, 33(6): 1629-1632
CrossRef Google scholar
SinghalA. Introducing the Knowledge Graph: Things, not Strings, 2012 (2012-5-16)
SmithS A, BrownJ W, HinchliffC E. Analyzing and Synthesizing Phylogenies Using Tree Alignment Graphs. PLoS Computational Biology, 2013, 9(9): e1003223
CrossRef Google scholar
VermaA K, PrakashS. Status of Animal Phyla in Different Kingdom Systems of Biological Classification. International Journal of Biological Innovations, 2020, 2(2): 149-154
CrossRef Google scholar
WangC B, MaX G, ChenJ G, et al. . Information Extraction and Knowledge Graph Construction from Geoscience Literature. Computers and Geosciences, 2018, 112: 112-120
CrossRef Google scholar
WangC S, HazenR M, ChengQ M, et al. . The Deep-Time Digital Earth Program: Data-Driven Discovery in Geosciences. National Science Review, 2021, 8(9): nwab027
CrossRef Google scholar
WangZ, ZhangJ W, FengJ L, et al. . Knowledge Graph Embedding by Translating on Hyperplanes. Proceedings of the AAAI Conference on Artificial Intelligence, 2014, 28(1): 1112-1119
CrossRef Google scholar
XiJ L, WuJ, WuM B. Design and Construction of Lightweight Domain Ontology of Tectonic Geomorphology. Journal of Earth Science, 2023, 34(5): 1350-1357
CrossRef Google scholar
ZhangL N, HouZ S, ShenB H, et al. . Paleobiogeographic Knowledge Graph: An Ongoing Work with Fundamental Support for Future Research. Journal of Earth Science, 2023, 34(5): 1339-1349
CrossRef Google scholar
ZhouZ Y, SunG, WangJ, et al. . Palaeobotanica Sinica, 2020 Beijing Science Press (in Chinese with English Abstract)
ZhuY Q, ZhouW W, XuY, et al. . Intelligent Learning for Knowledge Graph towards Geological Data. Scientific Programming, 2017, 2017(1): 5072427

Accesses

Citations

Detail

Sections
Recommended

/