Multi-directional attention: a lightweight attention module for slender structures

Dong Liu , Yaoren Zhang , Yu Ren , Dehua Pan , Xiang He , Ming Cong , Guanghai Yu

Intelligence & Robotics ›› 2025, Vol. 5 ›› Issue (4) : 827 -43.

PDF
Intelligence & Robotics ›› 2025, Vol. 5 ›› Issue (4) :827 -43. DOI: 10.20517/ir.2025.42
Research Article

Multi-directional attention: a lightweight attention module for slender structures

Author information +
History +
PDF

Abstract

The segmentation of slender structures in images faces challenges of discontinuous segmentation and insufficient recognition. These slender structures, such as arteries and veins, are particularly important in 3D medical images that are sensitive to the computational complexity of segmentation networks. Therefore, in order to balance the computational complexity of the network and adaptive perception of slender structures, this paper proposes a multi-directional attention module for slender structures, which is a lightweight attention module that can be inserted into the encoder or decoder unit. At the same time, we propose a contour loss function to address the class imbalance phenomenon that may arise in the joint segmentation task of slender and ordinary structures. This function improves the balance by converting traditional class mask labels into contour mask labels. The effectiveness of our proposed module has been validated through training on segmentation tasks on 2D and 3D images.

Keywords

Image segmentation / slender structures / medical image / convolutional neural network

Cite this article

Download citation ▾
Dong Liu, Yaoren Zhang, Yu Ren, Dehua Pan, Xiang He, Ming Cong, Guanghai Yu. Multi-directional attention: a lightweight attention module for slender structures. Intelligence & Robotics, 2025, 5(4): 827-43 DOI:10.20517/ir.2025.42

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Shao P,Yin C.Laparoscopic partial nephrectomy with segmental renal artery clamping: technique and clinical outcomes.Eur Urol2011;59:849-55

[2]

Yu F. , Koltun V. Multi-scale context aggregation by dilated convolutions. arXiv 2015, arXiv:1511.07122. https://arxiv.org/abs/1511.07122. (accessed 10 Sep 2025)

[3]

Dai J,Xiong Y.Deformable convolutional networks. In 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy. October 22-29, 2017. IEEE; 2017. pp. 764-73.

[4]

Qi Y,Qi X,Yang G.Dynamic snake convolution based on topological geometric constraints for tubular structure segmentation. In 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France. October 01-06, 2023. IEEE; 2023. pp 6047-56.

[5]

Zou S,Luo H,Qian Y.AF-Net: all-scale feature fusion network for road extraction from remote sensing images. In 2021 Digital Image Computing: Techniques and Applications (DICTA), Gold Coast, Australia. Novermber 29 - December 01, 2021. IEEE; 2021. pp 66-73.

[6]

Xian Y,Chen X.DCFU-Net: rethinking an effective attention and convolutional architecture for retinal vessel segmentation.Int J Imaging Syst Tech2025;35:e70003

[7]

Yang C,Chi D.Contour attention network for cerebrovascular segmentation from TOF-MRA volumetric images.Med Phys2024;51:2020-31

[8]

Jalali Y,Rezvani M.VGA-Net: vessel graph based attentional U-Net for retinal vessel segmentation.IET Image Processing2024;18:2191-213

[9]

Shit S,Sekuboyina A.clDice - a novel topology-preserving loss function for tubular structure segmentation. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR), Nashiville, USA. June 20-25, 2021. IEEE; 2021. pp. 16560-9.

[10]

Wang Y,Liu F.Deep distance transform for tubular structure segmentation in CT scans. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA. June 13-19, 2020. IEEE; 2020. pp. 3832-41.

[11]

Shang Z,Huang H.DCNet: a lightweight retinal vessel segmentation network.Digit Signal Process2024;153:104651

[12]

Pang Y,Huang T.Slim UNETR: scale hybrid transformers to efficient 3D medical image segmentation under limited computational resources.IEEE Trans Med Imaging2024;43:994-1005

[13]

Ronneberger O,Brox T.U-Net: convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015, Munich, Germany. Springer, Cham; 2015. pp. 234-41.

[14]

Zhang Z,Wang Y.Road extraction by deep residual U-Net.IEEE Geosci Remote Sensing Lett2018;15:749-53

[15]

Zhou Z,Tajbakhsh N.UNet++: a nested U-Net architecture for medical image segmentation. In: Stoyanov D, Taylor Z, Carneiro G, et al. editors. Deep learning in medical image analysis and multimodal learning for clinical decision support. Cham: Springer International Publishing; 2018. pp. 3-11.

[16]

Huang H,Tong R.Unet 3+: a full-scale connected UNet for medical image segmentation. In ICASSP 2020 - 2020 IEEE international conference on acoustics, speech and signal processing (ICASSP), Barcelona, Spain. May 04-08, 2020. IEEE; 2020. pp. 1055-9.

[17]

Li J,Taha A,Zhao T.Segmentation of renal structures for image-guided surgery. In: Frangi AF, Schnabel JA, Davatzikos C, Alberola-López C, Fichtinger G, editors. Medical image computing and computer assisted intervention - MICCAI 2018. Cham: Springer International Publishing; 2018. pp. 454-62.

[18]

Dosovitskiy A,Kolesnikov A. An image is worth 16 × 16 words: transformers for image recognition at scale. arXiv 2020, arXiv:2010.11929. https://arxiv. org/abs/2010.11929. (accessed 10 Sep 2025)

[19]

Liu Z,Cao Y.Swin transformer: hierarchical vision transformer using shifted windows. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, Canada. October 10-17, 2021. IEEE; 2021. pp. 10012-22.

[20]

Chollet F.Xception: deep learning with depthwise separable convolutions. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, USA. July 21-26, 2017. IEEE; 2017. pp. 1251-8.

[21]

Ho J,Weissenborn D. Axial attention in multidimensional transformers. arXiv 2019, arXiv:1912.12180. https://arxiv.org/abs/1912.12180. (accessed 10 Sep 2025)

[22]

He Y,Yang J.Meta grayscale adaptive network for 3D integrated renal structures segmentation.Med Image Anal2021;71:102055

[23]

He Y,Yang J.Dense biased networks with deep priori anatomy and hard region adaptation: semi-supervised learning for fine renal artery segmentation.Med Image Anal2020;63:101722

[24]

Shao P,Li P.Precise segmental renal artery clamping under the guidance of dual-source computed tomography angiography during laparoscopic partial nephrectomy.Eur Urol2012;62:1001-8

[25]

Grand Challenge. Kidney Parsing Challenge 2022. https://kipa22.grand-challenge.org/. (accessed 2025-09-10)

[26]

Staal J,Niemeijer M.Ridge-based vessel segmentation in color images of the retina.IEEE Trans Med Imaging2004;23:501-9

[27]

Hoover A,Goldbaum M.Locating blood vessels in retinal images by piecewise threshold probing of a matched filter response.IEEE Trans Med Imaging2000;19:203-10

[28]

Fraz MM,Hoppe A.An ensemble classification-based approach applied to retinal blood vessel segmentation.IEEE Trans Biomed Eng2012;59:2538-48

[29]

Kaggle. Satellite-road-segmentation-dataset. https://www.kaggle.com/datasets/timothlaborie/roadsegmentation-boston-losangeles/. (accessed 2025-09-10)

AI Summary AI Mindmap
PDF

293

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/