Multi-satellite cooperative scheduling for time-sensitive space targets tracking based on double deep Q-network
Shunyi CAO , Xiaolu LIU , Lei HE , Jie CHUN
Eng. Manag ››
Tracking time-sensitive space targets, characterized by high uncertainty and highly dynamic motion, requires multi-satellite coordination while accounting for the risk of target loss. This study proposes a mathematical model and a Time-Sensitive Space Multi-Target Observation Scheduling (TSSMTOS) algorithm that considers both tracking rewards and target loss scenarios. First, we establish a generalized real-time multi-satellite scheduling framework for tracking these unpredictable, rapidly evolving space targets. Subsequently, we introduce a Double Deep Q-network for Variable-Number Targets (DDQN-VNT). This approach enables the real-time allocation of dual satellites to each target, guided by heuristic rules, effectively addressing dynamic observation requirements. Experimental results demonstrate that DDQN-VNT outperforms traditional rule-based algorithms, the Parallel Dual Adaptive Genetic Algorithm (PDA-GA), and Adaptive Large Neighborhood Search (ALNS) in complex scenarios, exhibiting superior performance, enhanced efficiency, and robust generalization capabilities.
multi-satellite scheduling / time-sensitive targets / double deep Q-network (DDQN) / reinforcement learning (RL)
| [1] |
Bao X, Zhang S, Zhang X 2020. An effective method for satellite mission scheduling based on reinforcement learning. In: Proceedings of 2020 Chinese Automation Congress (CAC), Shanghai, China, 4037–4042 |
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
Li X, Chen J, Cai X, Cui N 2022. Satellite online scheduling algorithm based on proximal policy. In: Artificial Intelligence in China. Singapore: Springer, 100–108 |
| [16] |
|
| [17] |
|
| [18] |
Liu Z, Xiong W 2021. A DQN-based hyperheuristic algorithm for emergency scheduling of earth observation satellites. In: Proceedings of 2021 2nd International Conference on Electronics, Communications and Information Technology, Sanya, China, 39–47 |
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
Sunehag P, Lever G, Gruslys A, Czarnecki W, Zambaldi V, Jaderberg M, Lanctot M, Sonnerat N, Leibo Joel Z, Tuyls K, Graepel T (2018). Value-decomposition networks for cooperative multi-agent learning based on team reward. In: Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS), Stockholm, Sweden, pp 2085–2087 |
| [25] |
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez Aidan N, Kaiser L, Polosukhin I (2017). Attention is all you need. Preprint at arXiv. arXiv:1706.03762 |
| [26] |
|
| [27] |
|
| [28] |
Wang S (2012). Research on technology of initial alignment and inertial device calibration off the lunar surface. Dissertation for Master’s Degree. Harbin: Harbin Institute of Technology (in Chinese) |
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
Wen J (2022). Research on autonomous collaborative scheduling method of low-middle orbit satellite constellation for space moving target tracking. Dissertation for the Master’s Degree. Changsha: National University of Defense Technology (in Chinese) |
| [33] |
|
| [34] |
Yang W 2021. Research on multi-satellite collaborative planning and autonomous scheduling for moving targets tracking. Dissertation for the Doctoral Degree. Changsha: National University of Defense Technology (in Chinese) |
| [35] |
Zhang S (2021). Research on multi-target onboard cooperative observation technology for agile satellites. Dissertation for the Doctoral Degree. Shanghai: Innovation Academy for Microsatellites, Chinese Academy of Sciences (in Chinese) |
| [36] |
|
| [37] |
|
| [38] |
|
Higher Education Press
/
| 〈 |
|
〉 |