DLFE-YOLO: an enhanced framework for underwater object detection based on YOLOv8
Yuhan Lin , Yuanyuan Zhang , Dongsen Yang , Yukangping Zhou , Longtao Wang
Intelligent Marine Technology and Systems ›› 2025, Vol. 3 ›› Issue (1) : 37
DLFE-YOLO: an enhanced framework for underwater object detection based on YOLOv8
Underwater object detection remains challenging because of low visibility, complex lighting, and varied target scales, which limit the effectiveness of conventional deep learning methods. To overcome these limitations, this study proposes DLFE-You Only Look Once (YOLO), a novel YOLOv8-based framework that enhances feature representation and detection robustness in underwater environments. Its key innovations include its improved attention mechanisms and adaptive feature modeling to better capture irregular and small targets, as well as a loss function that emphasizes difficult-to-detect objects. Extensive experiments on two public benchmarks (URPC2020 and URPC2018) demonstrate that the proposed framework consistently outperforms state-of-the-art detectors, achieving mAP@0.5 values of 84.4% and 77.9%, respectively. Transfer learning experiments further indicate that DLFE-YOLO maintains strong adaptability across datasets, effectively leveraging knowledge from larger datasets to improve detection in smaller or less diverse underwater domains. These results highlight the model’s ability to address the shortcomings of existing methods. It improves both detection accuracy and generalization and provides a practical solution for intelligent underwater perception systems.
Underwater object detection / ResNet / YOLO / Loss function
| [1] |
Alam M, Khan IR, Siddiqui F, Alam MA (2024) Artificial intelligence as key enabler for safeguarding the marine resources. In: De D et al (eds) Artificial intelligence and edge computing for sustainable ocean health. Springer, Cham, pp 409–451. https://doi.org/10.1007/978-3-031-64642-3_18 |
| [2] |
|
| [3] |
Bajpai V, Sharma A, Subudhi BN, Veerakumar T, Jakhetiya V (2021) Underwater U-Net: deep learning with U-Net for visual underwater moving object detection. In: OCEANS 2021: San Diego–Porto. IEEE, pp 1–4. https://doi.org/10.23919/OCEANS44145.2021.9705761 |
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
He KM, Zhang XY, Ren SQ, Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, pp 770–778. https://doi.org/10.1109/CVPR.2016.90 |
| [15] |
|
| [16] |
|
| [17] |
Li H, Wu AY, Jiang ZY, Liu F, Luo MY (2024) Improving object detection in YOLOv8n with the C2f-f module and multi-scale fusion reconstruction. In: 2024 IEEE 6th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC). IEEE, pp 374–379. https://doi.org/10.1109/IMCEC59810.2024.10575292 |
| [18] |
|
| [19] |
|
| [20] |
Liang XT, Song PH (2022) Excavating RoI attention for underwater object detection. In: 2022 IEEE International Conference on Image Processing (ICIP). IEEE, pp 2651–2655. https://doi.org/10.1109/ICIP46576.2022.9897515 |
| [21] |
Liu FH, Fei YH, Su M, Zang R, Sun XF, Wang GC (2024a) A novel lightweight model for underwater small object detection. In: 2024 China Automation Congress (CAC). IEEE, pp 2226–2231. https://doi.org/10.1109/CAC63892.2024.10865529 |
| [22] |
Liu HK, Luo XN, Li F (2024b) YOLOv5 garbage target detection based on lightweight convolution in marine environment. In: 2024 International Symposium on Digital Home (ISDH). IEEE, pp 31–36. https://doi.org/10.1109/ISDH64927.2024.00012 |
| [23] |
Liu JJ, Liu ZT, Wang YP, Li F (2024c) VVNet: underwater object detection network based on vision Transformer and vision retnet for underwater robot picking. In: 2024 International Symposium on Digital Home (ISDH). IEEE, pp 19–24. https://doi.org/10.1109/ISDH64927.2024.00010 |
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, pp 779–788. https://doi.org/10.1109/CVPR.2016.91 |
| [28] |
|
| [29] |
|
| [30] |
Varghese R, Sambath M (2023) A comprehensive review on two-stage object detection algorithms. In: 2023 International Conference on Quantum Technologies, Communications, Computing, Hardware and Embedded Systems Security (iQ-CCHESS). IEEE, pp 1–7. https://doi.org/10.1109/iQ-CCHESS56596.2023.10391506 |
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
Yaseen M (2024) What is YOLOv8: an in-depth exploration of the internal features of the next-generation object detector. Preprint at arXiv:2408.15857. |
| [38] |
Yu JH, Jiang YN, Wang ZY, Cao ZM, Huang T (2016) UnitBox: an advanced object detection network. In: Proceedings of the 24th ACM International Conference on Multimedia. ACM, pp 516–520. https://doi.org/10.1145/2964284.2967274 |
| [39] |
|
| [40] |
|
| [41] |
|
| [42] |
|
| [43] |
|
| [44] |
Zhao D, Yang B, Dou YK, Guo XJ (2022) Underwater fish detection in sonar image based on an improved faster RCNN. In: 2022 9th International Forum on Electrical Engineering and Automation (IFEEA). IEEE, pp 358–363. https://doi.org/10.1109/IFEEA57288.2022.10038226 |
| [45] |
|
| [46] |
|
| [47] |
|
| [48] |
Zhou JJ, Xu DM, Min XY, Wu D (2024b) An improved underwater target detection algorithm based on YOLOX. In: OCEANS 2024–Singapore. IEEE, pp 1–7. https://doi.org/10.1109/OCEANS51537.2024.10682303 |
| [49] |
|
| [50] |
Zhu HG, Luo XN, Zhai ZY, Li J, Li F, Zhu XS (2024) UNet-based YOLO for underwater object detection. In: 2024 IEEE First International Conference on Data Intelligence and Innovative Application (DIIA). IEEE, pp 1–7. https://doi.org/10.1109/DIIA62678.2024.10871910 |
| [51] |
|
The Author(s)
/
| 〈 |
|
〉 |