Multi-scale detector optimized for small target

Yongchang Zhu, Sen Yang, Jigang Tong, Zenghui Wang

Optoelectronics Letters ›› 2024, Vol. 20 ›› Issue (4) : 243-248.

Optoelectronics Letters All Journals
Optoelectronics Letters ›› 2024, Vol. 20 ›› Issue (4) : 243-248. DOI: 10.1007/s11801-024-3126-1
Article

Multi-scale detector optimized for small target

Author information +
History +

Abstract

The effectiveness of deep learning networks in detecting small objects is limited, thereby posing challenges in addressing practical object detection tasks. In this research, we propose a small object detection model that operates at multiple scales. The model incorporates a multi-level bidirectional pyramid structure, which integrates deep and shallow networks to simultaneously preserve intricate local details and augment global features. Moreover, a dedicated multi-scale detection head is integrated into the model, specifically designed to capture crucial information pertaining to small objects. Through comprehensive experimentation, we have achieved promising results, wherein our proposed model exhibits a mean average precision (mAP) that surpasses that of the well-established you only look once version 7 (YOLOv7) model by 1.1%. These findings validate the improved performance of our model in both conventional and small object detection scenarios.

Cite this article

Download citation ▾
Yongchang Zhu, Sen Yang, Jigang Tong, Zenghui Wang. Multi-scale detector optimized for small target. Optoelectronics Letters, 2024, 20(4): 243‒248 https://doi.org/10.1007/s11801-024-3126-1

References

[1]
LiaoY R, WangH N, LinC B, et al.. Research progress of optical remote sensing image target detection based on deep learning[J]. Journal on communications, 2022, 43(5):190-203
[2]
ZhangT, LiZ, SunZ, et al.. A fully convolutional anchor-free object detector[J]. The visual computer, 2023, 39(2):569-580
CrossRef Google scholar
[3]
MohammadkarimiM, MehrabiM, ArdakaniM, et al.. Deep learning-based sphere decoding[J]. IEEE transactions on wireless communications, 2019, 18(9):4368-4378
CrossRef Google scholar
[4]
LiZ, GuoQ, SunB, et al.. Small object detection methods in complex background: an overview[J]. International journal of pattern recognition and artificial intelligence, 2023, 37(2):2350002
CrossRef Google scholar
[5]
LiR, HuJ, LiS, et al.. Blind detection of communication signals based on improved YOLO3[C], 2021, New York, IEEE: 424-429
[6]
VaraP N R S, D’souzakevinB, BhargavavijayK. A downscaled faster-RCNN framework for signal detection and time-frequency localization in wideband RF systems[J]. IEEE transactions on wireless communications, 2020, 19(7):4847-4862
CrossRef Google scholar
[7]
WanY, LiaoZ, LiuJ, et al.. Small object detection leveraging density-aware scale adaptation[J]. The photogrammetric record, 2023, 38(182):160-175
CrossRef Google scholar
[8]
QinH, WuY, DongF, et al.. Dense sampling and detail enhancement network: improved small object detection based on dense sampling and detail enhancement[J]. IET computer vision, 2022, 16(4):307-316
CrossRef Google scholar
[9]
XiaoZ H, DongE Z, TongJ G, et al.. Light weight object detector based on composite attention residual network and boundary location loss[J]. Neurocomputing, 2022, 494: 132-147
CrossRef Google scholar
[10]
ZhangS F, WangQ, ZhuT, et al.. Detection and classification of small traffic signs based on cascade network[J]. Chinese journal of electronics, 2021, 30(4):727-735
[11]
ChenS, LiZ, TangZ. Relation R-CNN: a graph based relation-aware network for object detection[J]. IEEE signal processing letters, 2020, 27: 1680-1684
CrossRef Google scholar
[12]
XuD, GuanJ, FengP, et al.. Association loss for visual object detection[J]. IEEE signal processing letters, 2020, 27: 1435-1439
CrossRef Google scholar
[13]
RedmonJ, DivvalaS, GirshickR, et al.. You only look once: unified, real-time object detection[C], 2016, New York, IEEE: 779-788
[14]
RedmonJ, FarhadiA. YOLO9000: better, faster, stronger[C], 2017, New York, IEEE: 7263-7271
[15]
REDMON J, FARHADI A. YOLOV3: an incremental improvement[EB/OL]. (2018-04-08) [2023-09-05]. https://arxiv.org/abs/1804.02767.
[16]
BOCHKOVSKIY A, WANG C Y, LIAO H Y M. YOLOV4: optimal speed and accuracy of object detection[EB/OL]. (2020-04-23) [2023-09-05]. https://arxiv.org/abs/2004.10934.
[17]
JIN H, SONG Q, HU X. Auto-Keras: efficient neural architecture search with network morphism[EB/OL]. (2018-06-27) [2023-09-05]. https://arxiv.org/abs/1806.10282v2.
[18]
TanM, PangR, LeQ V. EfficientDet: scalable and efficient object detection[C], 2020, New York, IEEE: 10778-10787
[19]
LiF, GaoD, YangY, et al.. Small target deep convolution recognition algorithm based on improved YOLOv4[J]. International journal of machine learning and cybernetics, 2023, 14(2):387-394
CrossRef Google scholar
[20]
BosquetB, CoresD, SeidenariL, et al.. A full data augmentation pipeline for small object detection based on generative adversarial networks[J]. Pattern recognition: the journal of the pattern recognition society, 2023, 133: 108998-109010
CrossRef Google scholar
[21]
YangZ, YuH, FengM, et al.. Small object augmentation of urban scenes for real-time semantic segmentation[J]. IEEE transactions on image processing, 2020, 29: 5175-5190
CrossRef Google scholar
[22]
LeeG, HongS, ChoD. Self-supervised feature enhancement networks for small object detection in noisy images[J]. IEEE signal processing letters, 2021, 28: 1026-1030
CrossRef Google scholar
[23]
ZhangH, DuQ, QiQ, et al.. A recursive attention-enhanced bidirectional feature pyramid network for small object detection[J]. Multimedia tools and applications, 2023, 82(9):13999-14018
CrossRef Google scholar

40

Accesses

0

Citations

Detail

Sections
Recommended

/