Forestry big data platform by Knowledge Graph
Mengxi Zhao , Dan Li , Yongshen Long
Journal of Forestry Research ›› 2020, Vol. 32 ›› Issue (3) : 1305 -1314.
Forestry big data platform by Knowledge Graph
Using the advantages of web crawlers in data collection and distributed storage technologies, we accessed to a wealth of forestry-related data. Combined with the mature big data technology at its present stage, Hadoop’s distributed system was selected to solve the storage problem of massive forestry big data and the memory-based Spark computing framework to realize real-time and fast processing of data. The forestry data contains a wealth of information, and mining this information is of great significance for guiding the development of forestry. We conducts co-word and cluster analyses on the keywords of forestry data, extracts the rules hidden in the data, analyzes the research hotspots more accurately, grasps the evolution trend of subject topics, and plays an important role in promoting the research and development of subject areas. The co-word analysis and clustering algorithm have important practical significance for the topic structure, research hotspot or development trend in the field of forestry research. Distributed storage framework and parallel computing have greatly improved the performance of data mining algorithms. Therefore, the forestry big data mining system by big data technology has important practical significance for promoting the development of intelligent forestry.
Intelligent forestry / Co-word analysis / Knowledge Graph / Big data
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
Dong XL, Gabrilovich E, Heitz G, Horn W, Lao N, Murphy K (2014) Knowledge vault: a web-scale approach to probabilistic knowledge fusion, pp 601–610. https://doi.org/10.1145/2623330.2623623 |
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
Groc CD (2011) Babouk: focused web crawling for corpus compilation and automatic terminology extraction. In: IEEE/WIC/ACM international conference on web intelligence and intelligent agent technology, vol 1, pp 497–498 |
| [10] |
Gu Y, Ji L, Jiang Z, He J (2012) Endless and scalable knowledge table extraction from semi-structured websites. In: IEEE, international conference on data mining workshops. IEEE Computer Society, pp 835–842 |
| [11] |
|
| [12] |
Jiang L, Li B, Song M (2011) The optimization of HDFS based on small files. In: IEEE international conference on broadband network and multimedia technology, pp 912–915 |
| [13] |
Jie W, Yang S, Wang Y, Cheng H (2016) The crawling and analysis of agricultural products big data based on Jsoup. In: International conference on fuzzy systems & knowledge discovery. https://doi.org/10.1109/FSKD.2015.7382112 |
| [14] |
Li Z (2012) Parsing the internal structure of words: a new paradigm for Chinese word segmentation. Meeting of the Association for Computational Linguistics: Human Language Technologies |
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
Sun W (2012) A stacked sub-word model for joint Chinese word segmentation and part-of-speech tagging. Meeting of the Association for Computational Linguistics: Human Language Technologies. https://doi.org/10.2514/6.2006-1683 |
| [24] |
|
| [25] |
Wu B, Zhang M, Zeng H, Zhang X, Yan N, Meng J (2016) Agricultural monitoring and early warning in the era of big data. J Remote Sens 20(5) |
| [26] |
|
| [27] |
Zhao Y, Li G, Li F, Li D (2011) The establishment and application of spatial decisions support systems of digital forestry. International Conference on Geoinformatics. https://doi.org/10.1109/CTS.2014.6867550 |
| [28] |
|
/
| 〈 |
|
〉 |