Road target detection algorithm based on improved YOLOv5 in UAV images

Yi ZHANG , Ronggui MA , Chen LIANG

Journal of Measurement Science and Instrumentation ›› 2024, Vol. 15 ›› Issue (1) : 128 -139.

PDF (7705KB)
Journal of Measurement Science and Instrumentation ›› 2024, Vol. 15 ›› Issue (1) :128 -139. DOI: 10.62756/jmsi.1674-8042.2024013
Test and detection technology
research-article

Road target detection algorithm based on improved YOLOv5 in UAV images

Author information +
History +
PDF (7705KB)

Abstract

Aiming at the problems such as low accuracy and poor robustness of target detection caused by missed detection of small road targets and occlusion between targets in UAV images, an improved road target detection algorithm based on YOLOv5 combining convolutional block attention module(CBAM), called YOLOv5s-FCC, was proposed. Firstly, a small target sensing layer was introduced to improve the multi-scale model, and a small target YOLO detection head was added to improve the feature extraction ability of the network for small road targets. Secondly, the CBAM fused space and channel information to enhance important information in the network after it was introduced into different locations of the Backbone network to obtain the best fusion location of CBAM. Finally, CIoU loss function was used to improve the speed and accuracy of the calculation required for predicting the bounding box of image. The experimental results showed that compared with YOLOv5 algorithm, YOLOV5-FCC algorithm can improve mAP50 and mAP50-95 by 2.0% and 4.2%, respectively. The effectiveness of YOLOv5-FCC algorithm was also verified on VisDrone dataset, and the results showed that the established system can realize automatic detection of road targets.

Keywords

unmanned aerial vehicle(UAV) / road target detection / YOLOv5 / loss function / convolutional block attention module(CBAM)

Cite this article

Download citation ▾
Yi ZHANG, Ronggui MA, Chen LIANG. Road target detection algorithm based on improved YOLOv5 in UAV images. Journal of Measurement Science and Instrumentation, 2024, 15(1): 128-139 DOI:10.62756/jmsi.1674-8042.2024013

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

FAN Z H. Research and application of target detection based on YOLO. Chengdu: Sichuan University, 2021.

[2]

DALAL N, TRIGGS B. Histograms of oriented gradients for human detection//Computer Society Conference on Computer Vision and Pattern Recognition. June 20-25, 2005, San Diego, CA, USA. Piscataway, N.J.: IEEE, 2005: 886-893.

[3]

GAO X, WU Y, YANG K, et al. Vehicle bottom anomaly detection algorithm based on SIFT. Optik: International Journal for Light and Electron Optics, 2015, 126(23): 3562-3566.

[4]

HENG C K, YOKOMITSU S, MATSUMOTO Y, et al. Shrink boost for selecting multi-lbp histogram features in object detection// IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 16-21, 2012, Providence, RI, USA. Piscataway, N.J.: IEEE, 2012: 3250-3257.

[5]

ARDIANTO S, CHEN C J, HANG H M. Real-time traffic sign recognition using color segmentation and SVM// International Conference on Systems, Signals and Image Processing, May 22-24, 2017, Poznan, Poland, USA. Piscataway, N.J.: IEEE, 2017: 1-5.

[6]

ELLAHYANI A, ANSARI M E, JAAFARI I E. Traffic sign detection and recognition based on random forests. Applied Soft Computing, 2016, 46: 805-815.

[7]

GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation//Conference on Computer Vision and Pattern Recognition, June 23-28, 2014, Columbus, OH, USA. Piscataway, N.J.: IEEE, 2014: 580-587.

[8]

GIRSHICK R. Fast R-CNN//IEEE International Conference on Computer Vision, December 7-13, 2015, Santiago, Chile. Piscataway, N.J.: IEEE, 2015: 1440-1448.

[9]

REN S, HE K, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks. Advances in Neural Information Processing Systems, 2015, 28: 91-99.

[10]

REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: Unified, real-time object detection//Conference on Computer Vision and Pattern Recognition, June 27-30, 2016, Las Vegas, NV, USA. Piscataway, N.J.: IEEE, 2016: 779-788.

[11]

REDMON J, FARHADI A. Yolov3: An incremental improvement. 2018-04-08[2023-01-04]. 2018:1-6.

[12]

BOCHKOVSKIY A, WANG C Y, LIAO H Y M. YOLOv4: Optimal speed and accuracy of object detection. 2023-01-04].

[13]

ZHU X, LYU S, WANG X, et al. TPH-YOLOv5: Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios//International Conference on Computer Vision. October 10-17, 2021, Montreal, Canada. Piscataway. N.J.: IEEE, 2021: 2778-2788.

[14]

ZHAO L, WANG X, ZHANG Y. Vehicle object detection based on YOLOv5s fusion SENet. Journal of Graphics, 2022, 43(5): 776-782.

[15]

ZHANG S, WANG H, RAN X K. A lightweight traffic sign detection method based on YOLOv5. Electronic Measurement Technology, 2022, 45(8): 129-135

[16]

WANG C Y, LIAO H Y M, WU Y H, et al. CSPNet: A new backbone that can enhance learning capability of CNN//Conference on Computer Vision and Pattern Recognition, June 13-19, 2020, Seattle, WA, USA. Piscataway, N.J.: IEEE, 2020: 390-391.

[17]

HE K, ZHANG X, REN S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9): 1904-1916.

[18]

LIN T Y, DOLLÁR P, GIRSHICK R, et al. Feature pyramid networks for object detection//Conference on Computer Vision and pattern recognition, July 21-26, 2017, Honolulu, HI, USA. Piscataway, N.J.: IEEE, 2017: 2117-2125.

[19]

LIU S, QI L, QIN H, et al. Path aggregation network for instance segmentation//Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA. Piscataway, N.J.: IEEE, 2018: 8759-8768.

[20]

HU J, SHEN L, SUN G. Squeeze-and-excitation networks//Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA. Piscataway, N.J.: IEEE, 2018: 7132-7141

[21]

WANG Q, WU B, ZHU P, et al. ECA-Net: Efficient channel attention for deep convolutional neural networks //Conference on Computer Vision and Pattern Recognition. June 13-19, 2020, Seattle, WA, USA. Piscataway, N.J.: IEEE, 2020: 11534-11542.

[22]

WOO S, PARK J, LEE J Y, et al. Cbam: Convolutional block attention module//European Conference on Computer Vision. September 8-14, 2018, Munich, Germany. Berlin: Springer, 2018: 3-19.

[23]

WANG P, HUANG H, WANG M. Complex road target detection algorithm based on improved YOLOv5. Computer Engineering and Applications. 2022, 58(17):81-92.

PDF (7705KB)

46

Accesses

0

Citation

Detail

Sections
Recommended

/