Real-time instance segmentation of tree trunks from under-canopy images in complex forest environments

Chong Mo; Wenlong Song; Weigang Li; Guanglai Wang; Yongkang Li; Jianping Huang

doi:10.1007/s11676-025-01825-y

Journal of Forestry Research ›› 2025, Vol. 36 ›› Issue (1) : 0. DOI: 10.1007/s11676-025-01825-y

Original Paper

Real-time instance segmentation of tree trunks from under-canopy images in complex forest environments

Author information +

History +

Abstract

Tree trunk instance segmentation is crucial for under-canopy unmanned aerial vehicles (UAVs) to autonomously extract standing tree stem attributes. Using cameras as sensors makes these UAVs compact and lightweight, facilitating safe and flexible navigation in dense forests. However, their limited onboard computational power makes real-time, image-based tree trunk segmentation challenging, emphasizing the urgent need for lightweight and efficient segmentation models. In this study, we present RT-Trunk, a model specifically designed for real-time tree trunk instance segmentation in complex forest environments. To ensure real-time performance, we selected SparseInst as the base framework. We incorporated ConvNeXt-T as the backbone to enhance feature extraction for tree trunks, thereby improving segmentation accuracy. We further integrate the lightweight convolutional block attention module (CBAM), enabling the model to focus on tree trunk features while suppressing irrelevant information, which leads to additional gains in segmentation accuracy. To enable RT-Trunk to operate effectively under diverse complex forest environments, we constructed a comprehensive dataset for training and testing by combining self-collected data with multiple public datasets covering different locations, seasons, weather conditions, tree species, and levels of forest clutter. Compared with the other tree trunk segmentation methods, the RT-Trunk method achieved an average precision of 91.4% and the fastest inference speed of 32.9 frames per second. Overall, the proposed RT-Trunk provides superior trunk segmentation performance that balances speed and accuracy, making it a promising solution for supporting under-canopy UAVs in the autonomous extraction of standing tree stem attributes. The code for this work is available at https://github.com/NEFU-CVRG/RT-Trunk.

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Chong Mo, Wenlong Song, Weigang Li, Guanglai Wang, Yongkang Li, Jianping Huang. Real-time instance segmentation of tree trunks from under-canopy images in complex forest environments. Journal of Forestry Research, 2025, 36(1): 0 https://doi.org/10.1007/s11676-025-01825-y

This is a preview of subscription content, contact us for subscripton.

References

Publishing order | Descend order by publishing year | Descend order by cited within

Bolya D, Zhou C, Xiao FY, Lee YJ (2019) YOLACT: real-time instance segmentation. In: In: 2019 IEEE/CVF international conference on computer vision (ICCV). Seoul, Korea (South). IEEE, pp 9156–9165

Chen

, Nardari

, Lee

, Qu

, Liu

, Romero

RAF

, Kumar

. SLOAM: semantic lidar odometry and mapping for forest inventory. IEEE Robot Autom Lett, 2020, 5(2): 612-619.

CrossRef Google scholar

Cheng TH, Wang XG, Chen SY, Zhang WQ, Zhang Q, Huang C, Zhang ZX, Liu WY (2022) Sparse instance activation for real-time instance segmentation. In: 2022 IEEE/CVF conference on computer vision and pattern recognition (CVPR). New Orleans, LA, USA. IEEE, pp 4423–4432

Chisholm

, Rodríguez-Ronderos

, Lin

. Estimating tree diameters from an autonomous below-canopy UAV with mounted LiDAR. Remote Sens, 2021, 13(13): 2576.

CrossRef Google scholar

Czuni L, Ben Alaya K (2019) Low- and high-level methods for tree segmentation. In: 2019 10th IEEE international conference on intelligent data acquisition and advanced computing systems: technology and applications (IDAACS). Metz, France. IEEE. https://doi.org/10.1109/idaacs.2019.8924248

Czúni L, Kürtösi A, Ben Alaya K (2018) Color based clustering for trunk segmentation. In: 2018 25th international conference on systems, signals and image processing (IWSSIP). Maribor, Slovenia. IEEE, pp 1–4

da Silva

, Dos Santos

, Sousa

, Filipe

. Visible and thermal image-based trunk detection with deep learning for forestry mobile robotics. J Imaging, 2021, 7(9): 176. 8468268

CrossRef Pubmed Google scholar

da Silva

, dos Santos

, Filipe

, Sousa

, Oliveira

. Edge AI-based tree trunk detection for forestry monitoring robotics. Robotics, 2022, 11(6): 136.

CrossRef Google scholar

de Paula

, Olofsson

, Persson

, Lindberg

, Holmgren

. Individual tree detection and estimation of stem attributes with mobile laser scanning along boreal forest roads. ISPRS J Photogr Remote Sens, 2022, 187: 211-224.

CrossRef Google scholar

Dong

, Roy

, Isler

. Semantic mapping for orchard environments by merging two-sides reconstructions of tree rows. J Field Robot, 2020, 37(1): 97-121.

CrossRef Google scholar

Fan

, Feng

, Shen

, Khan

, Mannan

, Gao

, Chen

, Saeed

. A trunk-based SLAM backend for smartphones with online SLAM in large-scale forest inventories. ISPRS J Photogr Remote Sens, 2020, 162: 41-49.

CrossRef Google scholar

Fortin JM, Gamache O, Grondin V, Pomerleau F, Giguère P (2022) Instance segmentation for autonomous log grasping in forestry operations. In: In: 2022 IEEE/RSJ international conference on intelligent robots and systems (IROS). Kyoto, Japan. IEEE, pp 6064–6071.

Gogoi

, Ahirwal

, Sahoo

. Evaluation of ecosystem carbon storage in major forest types of Eastern Himalaya: Implications for carbon sink management. J Environ Manage, 2022, 302(Pt A): 113972.

CrossRef Pubmed Google scholar

Grondin

, Fortin

, Pomerleau

, Giguère

. Tree detection and diameter estimation based on deep learning. Forestry (Lond), 2023, 96(2): 264-276.

CrossRef Google scholar

He KM, Zhang XY, Ren SQ, Sun J (2016) Deep residual learning for image recognition. In: In: 2016 IEEE conference on computer vision and pattern recognition (CVPR). Las Vegas, NV, USA. IEEE, pp 770–778

He KM, Gkioxari G, Dollár P, Girshick R (2017) Mask R-CNN. In: 2017 IEEE international conference on computer vision (ICCV). Venice, Italy. IEEE, pp 2980–2988.

Hyyppä

, Yu

, Kaartinen

, Hakala

, Kukko

, Vastaranta

, Hyyppä

. Comparison of backpack, handheld, under-canopy UAV, and above-canopy UAV laser scanning for field reference data collection in boreal forests. Remote Sens, 2020, 12(20): 3327.

CrossRef Google scholar

Juman

, Wong

, Rajkumar

, Goh

. A novel tree trunk detection method for oil-palm plantation navigation. Comput Electron Agric, 2016, 128: 172-180.

CrossRef Google scholar

Karjalainen K (2023) Image segmentation for forestry scenes. Electrical Engineering

Ke L, Ye MQ, Danelljan M, Liu YF, Tai YW, Tang CK, Yu F, Ke L, Ye MQ, Danelljan M, Liu YF, Tai YW, Tang CK, Yu F (2024) Segment anything in high quality. In: Proceedings of the 37th international conference on neural information processing systems. New Orleans, LA, USA. ACM, pp 29914–29934

Earthshot Labs (2022) Tree binary segmentation. Accessed at: https://www.kaggle.com/datasets/earthshot/tree-binary-segmentation

Lagos J, Lempiö U, Rahtu E (2023) FinnWoodlands dataset. In: Image analysis. Springer Nature Switzerland, pp 95–110. https://doi.org/10.1007/978-3-031-31435-3_7

, Sun

, Wang

, Tan

, Xu

. Tree trunk detection in urban scenes using a multiscale attention-based deep learning method. Ecol Inform, 2023, 77: 102215.

CrossRef Google scholar

Liang

, Kankare

, Yu

, Hyyppä

, Holopainen

. Automated stem curve measurement using terrestrial laser scanning. IEEE Trans Geosci Remote Sens, 2014, 52(3): 1739-1748.

CrossRef Google scholar

Liang

, Kankare

, Hyyppä

, Wang

, Kukko

, Haggrén

, Yu

, Kaartinen

, Jaakkola

, Guan

, Holopainen

, Vastaranta

. Terrestrial laser scanning in forest inventories. ISPRS J Photogr Remote Sens, 2016, 115: 63-77.

CrossRef Google scholar

Liang

, Yao

, Qi

, Wang

. Forest in situ observations through a fully automated under-canopy unmanned aerial vehicle. Geo Spatial Inf Sci, 2024, 27(4): 983-999.

CrossRef Google scholar

Liu

, Wang

. Classification of tree species and stock volume estimation in ground forest images using deep learning. Comput Electron Agric, 2019, 166: 105012.

CrossRef Google scholar

Liu

, Chen

, Nardari

, Qu

, Cladera

, Taylor

, Kumar

. Challenges and opportunities for autonomous micro-UAVs in precision agriculture. IEEE Micro, 2022, 42(1): 61-68.

CrossRef Google scholar

Liu Z, Mao HZ, Wu CY, Feichtenhofer C, Darrell T, Xie SN (2022) A ConvNet for the 2020s. In: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). New Orleans, LA, USA. IEEE, pp 11966–11976

Lu Y, Rasmussen C (2011) Tree trunk detection using contrast templates. In: 2011 18th IEEE international conference on image processing. Brussels, Belgium. IEEE, pp 1253–1256

Niknejad

, Bidese-Puhl

, Bao

, Payn

, Zheng

. Phenotyping of architecture traits of loblolly pine trees using stereo machine vision and deep learning: Stem diameter, branch angle, and branch diameter. Comput Electron Agric, 2023, 211: 107999.

CrossRef Google scholar

Prabhu

, Liu

, Spasojevic

, Wu

, Shao

, Ong

, Lei

, Green

, Chaudhari

, Kumar

. UAVs for forestry: Metric-semantic mapping and diameter estimation with autonomous aerial robots. Mech Syst Signal Process, 2024, 208: 111050.

CrossRef Google scholar

Shi

, Wang

, Mo

, Yi

, Wu

. Automatic segmentation of standing trees from forest images based on deep learning. Sensors, 2022, 22(17): 6663. 9460454

CrossRef Pubmed Google scholar

Tremblay

, Béland

, Gagnon

, Pomerleau

, Giguère

. Automatic three-dimensional mapping for tree diameter measurements in inventory operations. J Field Robot, 2020, 37(8): 1328-1346.

CrossRef Google scholar

Wang XL, Zhang RF, Kong T, Li L, Shen CH, Wang XL, Zhang RF, Kong T, Li L, Shen CH (2020) SOLOv2. In: Proceedings of the 34th international conference on neural information processing systems. Vancouver, BC, Canada. ACM, pp 17721–17732

Wells

, Chung

. Real-time computer vision for tree stem detection and tracking. Forests, 2023, 14(2): 267.

CrossRef Google scholar

Woo S, Park J, Lee JY, Kweon IS (2018) CBAM: convolutional block attention module. In: Computer vision–ECCV 2018. Springer International Publishing, pp 3–19. https://doi.org/10.1007/978-3-030-01234-2_1

Zhou

, Yang

, Liu

, Li

, Xie

, Peng

. Dynamic allometric scaling of tree biomass and size. Nat Plants, 2021, 7(1): 42-49.

CrossRef Pubmed Google scholar