How performance metric choice influences individual tree mortality model selection
Aitor Vázquez-Veloso , Andrés Núñez-Bravo , Astor Toraño-Caicoya , Hans Pretzsch , Felipe Bravo
Journal of Forestry Research ›› 2026, Vol. 37 ›› Issue (1) : 49
How performance metric choice influences individual tree mortality model selection
Understanding tree mortality is crucial to understand forest dynamics and is essential for growth models and simulators. Although factors such as competition, drought, and pathogens drive mortality, their underlying mechanisms remain difficult to model. While substantial attention has focused on selecting appropriate algorithms and covariates, evaluating individual tree mortality models also requires careful selection of performance criteria. This study compares seven different metrics to assess their impact on model evaluation and selection. Results show that candidate models exhibited varying performances across metrics and that the choice of metric significantly influences the selection of the best model. When no confusion matrix was available, the area under the precision-recall curve (AUCPR) emerged as a more reliable alternative to the area under the ROC curve (AUC), offering a more informative assessment for imbalanced datasets. When a confusion matrix was available, Cohen’s Kappa coefficient (K) and Matthews correlation coefficient (MCC) outperformed accuracy-based metrics, providing a fairer evaluation of both live and dead tree classifications. These findings emphasize the importance of choosing appropriate evaluation standards to enhance mortality model assessment and ensure reliable predictions in forestry applications.
Forest modeling / Survival / Binary classification / Area under the precision-recall curve (AUCPR) / Mathews correlation coefficient (MCC)
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
Bravo F, Bravo-Núñez A (2017) Clasificación de la calidad de estación forestal mediante técnicas de aprendizaje automático (machine learning). In: Actas 7o Congreso Forestal Español. Sociedad Española de Ciencias Forestales, Cáceres. (in Spanish). https://www.congresoforestal.es/fichero.php?t=41725&i=5686&m=2185 |
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
Gamer M, Lemon J, Puspendra Singh IF (2019) irr: Various Coefficients of interrater reliability and agreement. CRAN. https://doi.org/10.32614/CRAN.package.irr |
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
Jeni LA, Cohn JF, De La Torre F (2013) Facing imbalanced data—recommendations for the use of performance metrics. In: 2013 humaine association conference on affective computing and intelligent interaction. Geneva, Switzerland. IEEE. https://doi.org/10.1109/acii.2013.47 |
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
Merkl D, Hasenauer H (1998) Using neural networks to predict individual tree mortality. In: Proceedings of the int’l conference on engineering applications of neural networks, pp 10–12 |
| [27] |
Naidu G, Zuva T, Sibanda EM (2023) A review of evaluation metrics in machine learning algorithms. In: Artificial intelligence application in networks and systems, Springer International Publishing, Cham, pp 15–25. https://doi.org/10.1007/978-3-031-35314-7_2 |
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
R Core Team (2021) R: a language and environment for statistical computing. R foundation for statistical computing. Vienna, Austria. https://www.R-project.org/ |
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
|
| [40] |
|
| [41] |
|
The Author(s)
/
| 〈 |
|
〉 |