Fusion network for small target detection based on YOLO and attention mechanism
Caie Xu, Zhe Dong, Shengyun Zhong, Yijiang Chen, Sishun Pan, Mingyang Wu
Fusion network for small target detection based on YOLO and attention mechanism
Target detection is an important task in computer vision research, and such an anomaly detection and the topic of small target detection task is more concerned. However, there are still some problems in this kind of researches, such as small target detection in complex environments is susceptible to background interference and poor detection results. To solve these issues, this study proposes a method which introduces the attention mechanism into the you only look once (YOLO) network. In addition, the amateur-produced mask dataset was created and experiments were conducted. The results showed that the detection effect of the proposed mothed is much better.
[[1]] |
KRIZHEVSKY A, SUTSKEVER I, HINTON G E. Imagenet classification with deep convolutional neural networks[J]. Communications of the ACM, 2017: 84–90.
|
[[2]] |
REDMON J, FARHADI A. YOLOv3: an incremental improvement[EB/OL]. (2018-04-08) [2023-06-24]. https://arxiv.org/abs/1804.02767.
|
[[3]] |
|
[[4]] |
BOCHKOVSKIY A, WANG C Y, LIAO H Y. YOLOv4: optimal speed and accuracy of object detection[EB/OL]. (2020-04-23) [2023-06-24]. https://arxiv.org/abs/2004.10934.
|
[[5]] |
|
[[6]] |
|
[[7]] |
|
[[8]] |
|
[[9]] |
|
[[10]] |
|
[[11]] |
|
[[12]] |
|
[[13]] |
DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An image is worth 16×16 words: transformers for image recognition at scale[EB/OL]. (2020-10-22) [2023-06-24]. https://arxiv.org/abs/2010.11929v1.
|
[[14]] |
|
[[15]] |
|
[[16]] |
|
[[17]] |
VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[J]. Neural information processing systems, neural information processing systems, 2017: 30.
|
[[18]] |
RAMACHANDRAN P, ZOPH B, LE Q. Searching for activation functions[EB/OL]. (2017-10-16) [2023-06-24]. https://arxiv.org/abs/1710.05941v2.
|
/
〈 |
|
〉 |