Classification of forest vegetation with the application of iterative reallocation and model-based clustering
Naghmeh Pakgohar , Javad Eshaghi Rad , Hossein Gholami , Ahmad Alijanpour , David W. Roberts , Attila Lengyel , Enrico Feoli
Classification of forest vegetation with the application of iterative reallocation and model-based clustering
Numerous clustering algorithms are valuable in pattern recognition in forest vegetation, with new ones continually being proposed. While some are well-known, others are underutilized in vegetation science. This study compares the performance of practical iterative reallocation algorithms with model-based clustering algorithms. The data is from forest vegetation in Virginia (United States), the Hyrcanian Forest (Asia), and European beech forests. Practical iterative reallocation algorithms were applied as non-hierarchical methods and Finite Gaussian mixture modeling was used as a model-based clustering method. Due to limitations on dimensionality in model-based clustering, principal coordinates analysis was employed to reduce the dataset’s dimensions. A log transformation was applied to achieve a normal distribution for the pseudo-species data before calculating the Bray–Curtis dissimilarity. The findings indicate that the reallocation of misclassified objects based on silhouette width (OPTSIL) with Flexible-β (– 0.25) had the highest mean among the tested clustering algorithms with Silhouette width 1 (REMOS1) with Flexible-β (– 0.25) second. However, model-based clustering performed poorly. Based on these results, it is recommended using OPTSIL with Flexible-β (– 0.25) and REMOS1 with Flexible-β (– 0.25) for forest vegetation classification instead of model-based clustering particularly for heterogeneous datasets common in forest vegetation community data.
Classification / Heuristic clustering / Finite mixture / Forest ecosystems / Model-based clustering
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
Feoli E, Orlóci L (1991) The properties and interpretation of observations in vegetation study. In: Computer assisted vegetation analysis. Springer, Dordrecht, pp 1–13. https://doi.org/10.1007/978-94-011-3418-7_1 |
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
Grun B (2018) Model-based clustering. Handbook of mixture analysis. In: Frühwirth-Schnatter S, Celeux G, Robert CP (eds) Chapman and Hall/CRC, pp 163–198 |
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
McCune B, Grace JB (2002) Analysis of ecological communities. MJM Software Design, Glenedon Beach |
| [38] |
Mielke PW Jr, Berry KJ (2007) Permutation methods: a distance function approach, 2nd edn. Springer, New York. https://doi.org/10.1007/978-0-387-69813-7 |
| [39] |
Mucina L, van der Maarel E (1989) Twenty years of numerical syntaxonomy. In: Numerical syntaxonomy. Springer, Amsterdam, pp 1–15. https://doi.org/10.1007/978-94-009-2432-1_1 |
| [40] |
|
| [41] |
Oksanen J, Blanchet FG, Kindt R, Legendre P, Minchin PR, O’hara RB, Simpson GL, Sólymos P, Stevens MHH, Wagner H (2022). Package ‘vegan’. Community ecology package, version 2(9), pp 1–295 |
| [42] |
|
| [43] |
|
| [44] |
|
| [45] |
|
| [46] |
|
| [47] |
|
| [48] |
R Core Team (2021) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria URL https://www.R-project.org/ |
| [49] |
|
| [50] |
|
| [51] |
|
| [52] |
|
| [53] |
Roberts DW (2022) Optpart: optimal partitioning of similarity relations. R package version, 2-0. |
| [54] |
|
| [55] |
|
| [56] |
|
| [57] |
|
| [58] |
|
| [59] |
|
| [60] |
|
| [61] |
|
| [62] |
|
| [63] |
|
| [64] |
Zhang YF, Horvath S, Ophoff R, Telesca D (2014) Comparison of clustering methods for time course genomic data: applications to aging effects. https://doi.org/10.48550/arXiv.1404.7534 |
| [65] |
|
| [66] |
|
| [67] |
Zuur AF, Ieno EN, Walker NJ, Saveliev AA, Smith GM (2009) Mixed effects models and extensions in ecology with R. Statistics for Biology and Health. Springer, New York, USA |
Northeast Forestry University
/
| 〈 |
|
〉 |