Detection of loop closure in visual SLAM: a stacked assorted auto-encoder based approach
Yuan Luo , Yuting Xiao , Yi Zhang , Nianwen Zeng
Optoelectronics Letters ›› 2021, Vol. 17 ›› Issue (6) : 354 -360.
Detection of loop closure in visual SLAM: a stacked assorted auto-encoder based approach
The current mainstream methods of loop closure detection in visual simultaneous localization and mapping (SLAM) are based on bag-of-words (BoW). However, traditional BoW-based approaches are strongly affected by changes in the appearance of the scene, which leads to poor robustness and low precision. In order to improve the precision and robustness of loop closure detection, a novel approach based on stacked assorted auto-encoder (SAAE) is proposed. The traditional stacked auto-encoder is made up of multiple layers of the same autoencoder. Compared with the visual BoW model, although it can better extract the features of the scene image, the output feature dimension is high. The proposed SAAE is composed of multiple layers of denoising auto-encoder, convolutional auto-encoder and sparse auto-encoder, it uses denoising auto-encoder to improve the robustness of image features, convolutional auto-encoder to preserve the spatial information of the image, and sparse auto-encoder to reduce the dimensionality of image features. It is capable of extracting low to high dimensional features of the scene image and preserving the spatial local characteristics of the image, which makes the output features more robust. The performance of SAAE is evaluated by a comparison study using data from new college dataset and city centre dataset. The methodology proposed in this paper can effectively improve the precision and robustness of loop closure detection in visual SLAM.
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
Shekhar R and Jawahar C V, Word Image Retrieval Using Bag of Visual Words, IEEE 10th IAPR International Workshop on Document Analysis Systems (DAS), 297 (2012). |
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
Liu Y and Zhang H, Visual Loop Closure Detection with a Compact Image Descriptor, IEEE/RSJ International Conference on Intelligent Robots and Systems, 1051 (2012). |
| [9] |
Zhang G, Lilly M J and Vela P A, Learning Binary Features Online from Motion Dynamics for Incremental Loop-Closure Detection and Place Recognition, IEEE International Conference on Robotics and Automation (ICRA), 765 (2016). |
| [10] |
|
| [11] |
G. Zhang, X. Yan and Y. Ye, Loop Closure Detection Via Maximization of Mutual Information. IEEE Access, 124217 (2019). |
| [12] |
|
| [13] |
|
| [14] |
Gomez-Ojeda R, Lopez-Antequera M, Petkov N and Gonzalez-Jimenez J, Training a Convolutional Neural Network for Appearance-Invariant Place Recognition. Computer Science, 1505 (2015). |
| [15] |
|
| [16] |
Merrill N and Huang G Q, Lightweight Unsupervised Deep Loop Closure, arXiv:1805.07703, 2018. |
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
S. Lange and M. Riedmiller, Deep Auto-Encoder Neural Networks in Reinforcement Learning, The 2010 International Joint Conference on Neural Networks, 1 (2010). |
| [23] |
Vincent P, Larochelle H and Bengio Y, Extracting and Composing Robust Features with Denoising Autoencoders, Machine Learning, Proceedings of the Twenty-Fifth International Conference, 1096 (2008). |
| [24] |
Masci J, Meier U and Ciresan D, Stacked Convolutional Auto-Encoders for Hierarchical Feature Extraction. Artificial Neural Networks and Machine Learning (ICANN), International Conference on Artificial Neural Networks. 52 (2011). |
| [25] |
|
| [26] |
Jiang X, Zhang Y, Zhang W and Xiao X, A Novel Sparse Auto-Encoder for Deep Unsupervised Learning, Sixth International Conference on Advanced Computational Intelligence (ICACI), 256 (2013). |
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
/
| 〈 |
|
〉 |