Real-time instance segmentation of tree trunks from under-canopy images in complex forest environments
Chong Mo, Wenlong Song, Weigang Li, Guanglai Wang, Yongkang Li, Jianping Huang
Journal of Forestry Research ›› 2025, Vol. 36 ›› Issue (1) : 0.
Real-time instance segmentation of tree trunks from under-canopy images in complex forest environments
Tree trunk instance segmentation is crucial for under-canopy unmanned aerial vehicles (UAVs) to autonomously extract standing tree stem attributes. Using cameras as sensors makes these UAVs compact and lightweight, facilitating safe and flexible navigation in dense forests. However, their limited onboard computational power makes real-time, image-based tree trunk segmentation challenging, emphasizing the urgent need for lightweight and efficient segmentation models. In this study, we present RT-Trunk, a model specifically designed for real-time tree trunk instance segmentation in complex forest environments. To ensure real-time performance, we selected SparseInst as the base framework. We incorporated ConvNeXt-T as the backbone to enhance feature extraction for tree trunks, thereby improving segmentation accuracy. We further integrate the lightweight convolutional block attention module (CBAM), enabling the model to focus on tree trunk features while suppressing irrelevant information, which leads to additional gains in segmentation accuracy. To enable RT-Trunk to operate effectively under diverse complex forest environments, we constructed a comprehensive dataset for training and testing by combining self-collected data with multiple public datasets covering different locations, seasons, weather conditions, tree species, and levels of forest clutter. Compared with the other tree trunk segmentation methods, the RT-Trunk method achieved an average precision of 91.4% and the fastest inference speed of 32.9 frames per second. Overall, the proposed RT-Trunk provides superior trunk segmentation performance that balances speed and accuracy, making it a promising solution for supporting under-canopy UAVs in the autonomous extraction of standing tree stem attributes. The code for this work is available at https://github.com/NEFU-CVRG/RT-Trunk.
[] |
Bolya D, Zhou C, Xiao FY, Lee YJ (2019) YOLACT: real-time instance segmentation. In: In: 2019 IEEE/CVF international conference on computer vision (ICCV). Seoul, Korea (South). IEEE, pp 9156–9165
|
[] |
|
[] |
Cheng TH, Wang XG, Chen SY, Zhang WQ, Zhang Q, Huang C, Zhang ZX, Liu WY (2022) Sparse instance activation for real-time instance segmentation. In: 2022 IEEE/CVF conference on computer vision and pattern recognition (CVPR). New Orleans, LA, USA. IEEE, pp 4423–4432
|
[] |
|
[] |
Czuni L, Ben Alaya K (2019) Low- and high-level methods for tree segmentation. In: 2019 10th IEEE international conference on intelligent data acquisition and advanced computing systems: technology and applications (IDAACS). Metz, France. IEEE. https://doi.org/10.1109/idaacs.2019.8924248
|
[] |
Czúni L, Kürtösi A, Ben Alaya K (2018) Color based clustering for trunk segmentation. In: 2018 25th international conference on systems, signals and image processing (IWSSIP). Maribor, Slovenia. IEEE, pp 1–4
|
[] |
|
[] |
|
[] |
|
[] |
|
[] |
|
[] |
Fortin JM, Gamache O, Grondin V, Pomerleau F, Giguère P (2022) Instance segmentation for autonomous log grasping in forestry operations. In: In: 2022 IEEE/RSJ international conference on intelligent robots and systems (IROS). Kyoto, Japan. IEEE, pp 6064–6071.
|
[] |
|
[] |
|
[] |
He KM, Zhang XY, Ren SQ, Sun J (2016) Deep residual learning for image recognition. In: In: 2016 IEEE conference on computer vision and pattern recognition (CVPR). Las Vegas, NV, USA. IEEE, pp 770–778
|
[] |
He KM, Gkioxari G, Dollár P, Girshick R (2017) Mask R-CNN. In: 2017 IEEE international conference on computer vision (ICCV). Venice, Italy. IEEE, pp 2980–2988.
|
[] |
|
[] |
|
[] |
Karjalainen K (2023) Image segmentation for forestry scenes. Electrical Engineering
|
[] |
Ke L, Ye MQ, Danelljan M, Liu YF, Tai YW, Tang CK, Yu F, Ke L, Ye MQ, Danelljan M, Liu YF, Tai YW, Tang CK, Yu F (2024) Segment anything in high quality. In: Proceedings of the 37th international conference on neural information processing systems. New Orleans, LA, USA. ACM, pp 29914–29934
|
[] |
Earthshot Labs (2022) Tree binary segmentation. Accessed at: https://www.kaggle.com/datasets/earthshot/tree-binary-segmentation
|
[] |
Lagos J, Lempiö U, Rahtu E (2023) FinnWoodlands dataset. In: Image analysis. Springer Nature Switzerland, pp 95–110. https://doi.org/10.1007/978-3-031-31435-3_7
|
[] |
|
[] |
|
[] |
|
[] |
|
[] |
|
[] |
|
[] |
Liu Z, Mao HZ, Wu CY, Feichtenhofer C, Darrell T, Xie SN (2022) A ConvNet for the 2020s. In: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). New Orleans, LA, USA. IEEE, pp 11966–11976
|
[] |
Lu Y, Rasmussen C (2011) Tree trunk detection using contrast templates. In: 2011 18th IEEE international conference on image processing. Brussels, Belgium. IEEE, pp 1253–1256
|
[] |
|
[] |
|
[] |
|
[] |
|
[] |
Wang XL, Zhang RF, Kong T, Li L, Shen CH, Wang XL, Zhang RF, Kong T, Li L, Shen CH (2020) SOLOv2. In: Proceedings of the 34th international conference on neural information processing systems. Vancouver, BC, Canada. ACM, pp 17721–17732
|
[] |
|
[] |
Woo S, Park J, Lee JY, Kweon IS (2018) CBAM: convolutional block attention module. In: Computer vision–ECCV 2018. Springer International Publishing, pp 3–19. https://doi.org/10.1007/978-3-030-01234-2_1
|
[] |
|
/
〈 |
|
〉 |