Real-time K-TIG welding penetration prediction on embedded system using a segmentation-LSTM model
Yong-Hua Shi , Zi-Shun Wang , Xi-Yin Chen , Yan-Xin Cui , Tao Xu , Jin-Yi Wang
Advances in Manufacturing ›› 2023, Vol. 11 ›› Issue (3) : 444 -461.
Real-time K-TIG welding penetration prediction on embedded system using a segmentation-LSTM model
Keyhole tungsten inert gas (K-TIG) welding is capable of realizing single-sided welding and double-sided forming and has been widely used in medium and thick plate welding. In order to improve the accuracy of automatic weld identification and weld penetration prediction of robot in the process of large workpiece welding, a two-stage model is proposed in this paper, which can monitor the K-TIG welding penetration state in real time on the embedded system, called segmentation-LSTM model. The proposed system extracts 9 weld pool geometric features with segmentation network, and then extracts the weld gap using a traditional algorithm. Then these 10-dimensional features are input into the LSTM model to predict the penetration state, including under penetration, partial penetration, good penetration and over penetration. The recognition accuracy of the proposed system can reach 95.2%. In this system, to solve the difficulty of labeling data and lack of segmentation accuracy, an improved LabelMe capable of live-wire annotation tool and a novel loss function were proposed, respectively. The latter was also called focal dice loss, which enabled the network to achieve a performance of 0.933 mIoU on the testing set. Finally, an improved slimming strategy compresses the network, making the segmentation network achieve real-time on the embedded system (RK3399pro).
Keyhole tungsten inert gas (K-TIG) welding / Penetration state prediction / Segmentation-LSTM model / Embedded system / Focal dice loss / Improved LabelMe
| [1] |
|
| [2] |
Jarvis BL (2001) Keyhole gas tungsten arc welding: a new process variant. Dissertation, University of Wollongong |
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
Zhan AW, Shi YH, Chen JR (2021) The effect of butt gap on the molten pool and keyhole of K-TIG welding 304 stainless steel. Hot Working Technology 50(23):139–145 |
| [7] |
|
| [8] |
Richardson RW, Gutow DA, Rao SH (1982) A vision based system for arc weld pool size control. Measurement and control for batch manufacturing pp 65–75 |
| [9] |
|
| [10] |
|
| [11] |
Wu D, Chen HB, Huang YM et al (2016) Weld penetration identification for UPPAW based on keyhole features and extreme learning machine. In 2016 IEEE workshop on advanced robotics and its social impacts (ARSO) pp 96–99. https://doi.org/10.1109/arso.2016.7736263 |
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
Zhang WJ, Liu YK, Zhang YM (2012) Real-time measurement of three dimensional weld pool surface in GTAW. In Welding Processes. https://doi.org/10.5772/53753 |
| [22] |
Zhang K, Zhang YM, Chen JS et al (2017) Welding pool oscillation behaviors for pulsed GTA welding based on laser dot matrix sensing. In 2017 IEEE 7th Annual International Conference on CYBER Technology in Automation, Control, and Intelligent Systems (CYBER), pp 355–358. https://doi.org/10.1109/cyber.2017.8446232 |
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
wkentaro (2022) Labelme: image polygonal annotation with python (polygon, rectangle, circle, line, point and image-level flag annotation. https://github.com/wkentaro/labelme/ Accessed 6 March 2022 |
| [32] |
|
| [33] |
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440. https://doi.org/10.1109/CVPR.2015.7298965 |
| [34] |
Ronneberger O, Fischer P, Brox T (2015) U-Net: convolutional networks for biomedical image segmentation. In: international conference on medical image computing and computer-assisted intervention, pp 234–241. https://doi.org/10.1007/978-3-319-24574-4_28 |
| [35] |
Paszke A, Chaurasia A, Kim S et al (2016) ENet: a deep neural network architecture for real-time semantic segmentation. arXiv preprint. https://doi.org/10.48550/arXiv.1606.02147 |
| [36] |
|
| [37] |
He KM, Zhang XY, Ren SQ et al (2016) Deep residual learning for image recognition. In: proceedings of the IEEE conference on computer vision and pattern recognition pp 770–778. https://doi.org/10.1109/cvpr.2016.90 |
| [38] |
Cordts M, Omran M, Ramos S et al (2016) The cityscapes dataset for semantic urban scene understanding. In: proceedings of the IEEE conference on computer vision and pattern recognition pp 3213–3223. https://doi.org/10.1109/CVPR.2016.350 |
| [39] |
Jadon S (2020) A survey of loss functions for semantic segmentation. In: 2020 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB) pp 1–7. https://doi.org/10.1109/cibcb48159.2020.9277638 |
| [40] |
Ma YD, Liu Q, and Qian ZB (2004) Automated image segmentation using improved PCNN model based on cross-entropy. In Proceedings of 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing pp 743–746. https://doi.org/10.1109/isimp.2004.1434171 |
| [41] |
Lin TY, Goyal P, Girshick R et al (2017) Focal loss for dense object detection. In: proceedings of the IEEE international conference on computer vision pp 2980–2988. https://doi.org/10.1109/iccv.2017.324 |
| [42] |
Sudre CH, Li WQ, Vercauteren T et al (2017) Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations. In: deep learning in medical image analysis and multimodal learning for clinical decision support pp 240–248. https://doi.org/10.1007/978-3-319-67558-9_28 |
| [43] |
Berman M, Triki AR, Matthew BB (2018) The Lovasz-Softmax loss: a tractable surrogate for the optimization of the intersection-over-union measure in neural networks. In: proceedings of the IEEE conference on computer vision and pattern recognition pp 4413–4421. https://doi.org/10.1109/cvpr.2018.00464 |
| [44] |
Yu JQ, Blaschko MB (2015) The Lovász hinge: A convex surrogate for submodular losses. Stat 1050:24. https://doi.org/10.1109/tpami.2018.2883039 |
| [45] |
|
| [46] |
Shrivastava A, Gupta A, Girshick R (2016) Training region-based object detectors with online hard example mining. In: proceedings of the IEEE conference on computer vision and pattern recognition pp 761–769. https://doi.org/10.1109/cvpr.2016.89 |
| [47] |
Firefly (2022) Firefly: make technology simpler, make life smarter. https://www.t-firefly.com/doc/download/65.html/ Accessed 6 March 2022 |
| [48] |
Liu Z, Li JG, Shen ZQ et al (2017) Learning efficient convolutional networks through network slimming. In: proceedings of the IEEE international conference on computer vision pp 2736–2744. https://doi.org/10.1109/iccv.2017.298 |
| [49] |
Krishnamoorthi R (2018) Quantizing deep convolutional networks for efficient inference: a whitepaper. arXiv preprint. https://doi.org/10.48550/arXiv.1806.08342 |
| [50] |
Hecht-Nielsen R (1992) Theory of the backpropagation neural network. In: neural networks for perception pp 65–93. https://doi.org/10.1016/B978-0-12-741252-8.50010-8 |
| [51] |
|
| [52] |
|
| [53] |
Converti J (1981) Plasma-jets in arc welding. Dissertation, Massachusetts Institute of Technology |
| [54] |
|
/
| 〈 |
|
〉 |