An improved double deep Q-network algorithm for disassembly line balancing problems considering worker fatigue
Ruohong Shi , Xiaowei Xu , Zhongyuan Yang , Shuo Shi
Intelligent Marine Technology and Systems ›› 2026, Vol. 4 ›› Issue (1) : 10
The rapid obsolescence of electronic products highlights the critical need for efficient resource recovery through disassembly to support sustainable development. However, conventional disassembly methods often fail to handle complex disassembly sequences and dynamic operational constraints. To address the challenge of disassembly line balancing under worker fatigue, this study proposes an attention-based double deep Transformer Q-network (DDTQN) for minimizing disassembly time. It develops a disassembly time optimization model that incorporates fatigue-induced efficiency decay to enable the simulation of realistic operational conditions. By integrating an attention mechanism into the DDQ framework, the proposed approach enhances the capacity of the model to capture intricate task dependencies, thereby improving state representation, exploration efficiency, and long-term decision-making. Experimental results across three disassembly cases indicate that DDTQN reduces the average disassembly time by 19.37% compared with benchmark algorithms—including DDQN, deep Q-network (DQN), and advantage actor-critic. The successful application of DDTQN to marine equipment disassembly demonstrates its broad applicability and effectiveness, offering a robust solution for both general disassembly lines and specialized contexts such as ship recycling.
Double deep Transformer Q-learning algorithm / Deep reinforcement learning / Fatigue index / Disassembly line balance problem / Marine equipment recycling
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
Baldé CP, Kuehr R, Yamamoto T, McDonald R, D’Angelo E, Althaf S et al (2024) The global e-waste monitor 2024. United Nations University (UNU), International Telecommunication Union (ITU) & International Solid Waste Association (ISWA), Geneva/Bonn, pp 1–147. https://www.itu.int/en/ITU-D/Environment/Pages/Publications/The-Global-E-waste-Monitor-2024.aspx |
| [5] |
Bi ZL, Guo XW, Wang JC, Qin SJ, Qi L, Zhao J (2022) A Q-learning-based selective disassembly sequence planning method. In: 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC). IEEE, pp 3216–3221. https://doi.org/10.1109/SMC53654.2022.9945073 |
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
Dong CS, Liu PS, Guo XW, Qi L, Qin SJ, Xu GD (2021) Multi-objective ant lion optimizer for stochastic robotic disassembly line balancing problem subject to resource constraints. In: 2nd International Conference on Computer Vision and Data Mining (ICCVDM 2021). IOP Publishing, pp 1–7. https://doi.org/10.1088/1742-6596/2024/1/012014 |
| [10] |
Güler E, Kalayci CB, Ilgin MA, Özceylan E, Güngör A (2024) Advances in partial disassembly line balancing: a state-of-the-art review. Comput Ind Eng 188:109898. https://doi.org/10.1016/j.cie.2024.109898 |
| [11] |
Guo XW, Liu SX, Zhou MC, Tian GD (2016) Disassembly sequence optimization for large-scale products with multiresource constraints using scatter search and Petri nets. IEEE Trans Cybern 46(11):2435–2446. https://doi.org/10.1109/TCYB.2015.2478486 |
| [12] |
Guo XW, Wei TT, Wang JC, Liu SX, Qin SJ, Qi L (2023) Multiobjective U-shaped disassembly line balancing problem considering human fatigue index and an efficient solution. IEEE Trans Comput Soc Syst 10(4):2061–2073. https://doi.org/10.1109/TCSS.2022.3217101 |
| [13] |
Guo XW, Zhang ZW, Qi L, Liu SX, Tang Y, Zhao ZY (2022) Stochastic hybrid discrete grey wolf optimizer for multi-objective disassembly sequencing and line balancing planning in disassembling multiple products. IEEE Trans Autom Sci Eng 19(3):1744–1756. https://doi.org/10.1109/TASE.2021.3133601 |
| [14] |
Igarashi K, Inoue M, Yamada T (2014) 2-stage optimal design and analysis for disassembly system with environmental and economic parts selection using the recyclability evaluation method. Ind Eng Manag Syst 13(1):52–66. https://doi.org/10.7232/iems.2014.13.1.052 |
| [15] |
Iqbal KMJ, Heidegger P (2013) Pakistan ship-breaking outlook. Sustainable Development Policy Institute, and NGO Ship-Breaking Platform: Brussels, Belgium. https://scholar.google.com.hk/scholar?hl=zh-CN&as_sdt=0%2C5&q=Iqbal+KMJ%2C+Heidegger+P+%282013%29+Pakistan+ship-breaking+outlook.+Sustainable+Development+Policy+Institute%2C+and+NGO+Ship-Breaking+Platform%3A+Brussels%2C+Belgium.+&btnG= |
| [16] |
Jaber MY, Givi ZS, Neumann WP (2013) Incorporating human fatigue and recovery into the learning-forgetting process. Appl Math Model 37(12–13):7287–7299. https://doi.org/10.1016/j.apm.2013.02.028 |
| [17] |
Kalayci CB, Gupta SM (2013) Artificial bee colony algorithm for solving sequence-dependent disassembly line balancing problem. Expert Syst Appl 40(18):7231–7241. https://doi.org/10.1016/j.eswa.2013.06.067 |
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
Liu YZ, Zhou MC, Guo XW (2022) An improved Q-learning algorithm for human-robot collaboration two-sided disassembly line balancing problems. In: 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC). IEEE, pp 568–573. https://doi.org/10.1109/SMC53654.2022.9945263 |
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
Sibal P (2001) A database for dismantling of obsolete vessels. West Virginia University. https://scholar.google.com.hk/scholar?hl=zh-CN&as_sdt=0%2C5&q=Sibal+P+%282001%29+A+database+for+dismantling+of+obsolete+vessels.+West+Virginia+University&btnG= |
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
Wiering MA, van Otterlo M (2012) Reinforcement learning. Springer Berlin Heidelberg. https://doi.org/10.1007/978-3-642-27645-3 |
| [34] |
|
| [35] |
Yu ZS, Zhang XH (2023) Actor-critic alignment for offline-to-online reinforcement learning. In: Proceedings of the 40th International Conference on Machine Learning. PMLR, pp 40452–40474. https://proceedings.mlr.press/v202/yu23k.html |
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
|
The Author(s)
/
| 〈 |
|
〉 |