Attention-based efficient robot grasp detection network

Xiaofei QIN; Wenkai HU; Chen XIAO; Changxiang HE; Songwen PEI; Xuedian ZHANG

doi:10.1631/FITEE.2200502

PDF(18443 KB)

Front. Inform. Technol. Electron. Eng ›› 2023, Vol. 24 ›› Issue (10) : 1430-1444. DOI: 10.1631/FITEE.2200502

Orginal Article

Attention-based efficient robot grasp detection network

Xiaofei QIN¹ ,
Wenkai HU¹ ,
Chen XIAO² ,
Changxiang HE² ,
Songwen PEI¹^,³^,⁴ ,
Xuedian ZHANG¹^,³^,⁴^,⁵

Author information +

History +

Abstract

To balance the inference speed and detection accuracy of a grasp detection algorithm, which are both important for robot grasping tasks, we propose an encoder–decoder structured pixel-level grasp detection neural network named the attention-based efficient robot grasp detection network (AE-GDN). Three spatial attention modules are introduced in the encoder stages to enhance the detailed information, and three channel attention modules are introduced in the decoder stages to extract more semantic information. Several lightweight and efficient DenseBlocks are used to connect the encoder and decoder paths to improve the feature modeling capability of AE-GDN. A high intersection over union (IoU) value between the predicted grasp rectangle and the ground truth does not necessarily mean a high-quality grasp configuration, but might cause a collision. This is because traditional IoU loss calculation methods treat the center part of the predicted rectangle as having the same importance as the area around the grippers. We design a new IoU loss calculation method based on an hourglass box matching mechanism, which will create good correspondence between high IoUs and high-quality grasp configurations. AE-GDN achieves the accuracy of 98.9% and 96.6% on the Cornell and Jacquard datasets, respectively. The inference speed reaches 43.5 frames per second with only about 1.2 × 10⁶ parameters. The proposed AE-GDN has also been deployed on a practical robotic arm grasping system and performs grasping well. Codes are available at https://github.com/robvincen/robot_gradet.

Keywords

Robot grasp detection / Attention mechanism / Encoder-decoder / Neural network

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Xiaofei QIN, Wenkai HU, Chen XIAO, Changxiang HE, Songwen PEI, Xuedian ZHANG. Attention-based efficient robot grasp detection network. Front. Inform. Technol. Electron. Eng, 2023, 24(10): 1430‒1444 https://doi.org/10.1631/FITEE.2200502