A prohibited items identification approach based on semantic segmentation
Shao-qing Yao, Zhi-gang Su, Jin-feng Yang, Hai-gang Zhang
Optoelectronics Letters ›› 2021, Vol. 17 ›› Issue (4) : 247-251.
A prohibited items identification approach based on semantic segmentation
Deep learning (DL) based semantic segmentation methods can extract object information including category, location and shape. In this paper, the identification of prohibited items is regarded as a task of semantic segmentation, and proposes a universal model with automatic identification of prohibited items. This model has two improvements based on the general semantic segmentation network. Firstly, the N-type encoding structure is applied to enlarge the receptive field of the network aiming at reducing the misclassification. Secondly, consider the lack of surface texture in X-ray security images. Inspired by feature reuse in Densenet, shallow semantic information is reused to improve the segmentation accuracy. With the use of this model, when using input images of size 512×512, we could achieve 0.783 mean intersection over union (mIoU) for a seven-class object recognition problem.
[1] |
Caijing Miao, Lingxi Xie, Fang Wan, Chi Su, Hongye Liu, Jianbin Jiao and Qixiang Ye, SIXray: A Large-Scale Security Inspection X-Ray Benchmark for Prohibited Item Discovery in Overlapping Images, CVPR, 2119 (2019).
|
[2] |
|
[3] |
Long J, Shelhamer E and Darrell T, Fully Convolutional Networks for Semantic Segmentation, CVPR, 3431 (2015).
|
[4] |
Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva and Antonio Torralba, Learning Deep Features for Discriminative Localization, CVPR, 2921 (2016).
|
[5] |
Ronneberger O, Fischer P and Brox T, U-net: Convolutional Networks for Biomedical Image Segmentation, Medical Image Computing and Computer Assisted Intervention, 234 (2015).
|
[6] |
Hengshuang Zhao, Jianping Shi, Xiaojuan Qi, Xiaogang Wang and Jiaya Jia, Pyramid Scene Parsing Network, CVPR, 2881 (2017).
|
[7] |
Gao Huang, Zhuang Liu, Laurens van der Maaten and Kilian Q, Weinberger: Densely Connected Convolutional Networks, CVPR, 4700 (2017).
|
[8] |
|
[9] |
Yu F, Koltun V and Funkhouser T, Dilated Residual Networks, CVPR, 472 (2017).
|
[10] |
Liang-Chieh Chen, George Papandreou, Florian Schroff and Hartwig Adam, Rethinking Atrous Convolution for Semantic Image Segmentation, arXiv preprint arXiv 1706.05587, 2017.
|
[11] |
Liang-Chieh Chen, Yukun Zhu, George Papandreou, Florian Schroff and Hartwig Adam, Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation, ECCV, 801 (2018).
|
[12] |
Yu Wang, Quan Zhou, Jia Liu, Jian Xiong, Guangwei Gao, Xiaofu Wu and Longin Jan Latecki, LEDNet: a Lightweight Encoder-Decoder Network for Real-Time Semantic Segmentation, arXiv preprint arXiv, 1905.02423 (2019).
|
[13] |
Yu Wang, Quan Zhou and Xiaofu Wu, ESNet: An Efficient Symmetric Network for Real-Time Semantic Segmentation, PRCV, 41 (2019).
|
[14] |
Wei Y and Liu X, Dangerous Goods Detection Based on Transfer Learning in X-Ray Images, Neural Computing and Applications, 1 (2019).
|
[15] |
Xu C, Han N and Li H, A Dangerous Goods Detection Approach Based on YOLOv3, Computer Science and Artificial Intelligence, 600 (2018).
|
[16] |
Xu M, Zhang H and Yang J, Prohibited Item Detection in Airport X-Ray Security Images via Attention Mechanism Based CNN, PRCV, 429 (2018).
|
[17] |
Jiuyuan An, Haigang Zhang, Yue Zhu and Jinfeng Yang, Semantic Segmentation for Prohibited Items in Baggage Inspection, International Conference on Intelligent Science and Big Data Engineering, 495 (2019).
|
/
〈 |
|
〉 |