UAV swarm communication networking and routing optimization for high-demand users: a graph attention multi-agent reinforcement learning approach
Zhaopeng Ning , Gang Li , Wei Li
Autonomous Intelligent Systems ›› 2026, Vol. 6 ›› Issue (1) : 9
Unmanned aerial vehicle swarms serving ground high-demand communication users in dynamic environments must simultaneously optimize three-dimensional trajectories, communication network topology, and routing strategies while considering limited energy, link quality fluctuations, and collision avoidance constraints. This problem faces three core challenges: routing decisions under dynamic topology require real-time adaptation to vehicle position changes and channel variations; end-to-end delay and throughput optimization in multi-hop communication demands coordinated forwarding strategies across all vehicles; high-dimensional continuous action spaces and partial observability make traditional optimization methods difficult to solve. This paper models the problem as a multi-agent partially observable Markov decision process and proposes a graph attention-based multi-agent deep deterministic policy gradient algorithm to jointly optimize velocity vectors, communication power, and routing decisions for each vehicle. The reward function comprehensively considers user quality of service, system throughput, end-to-end delay, and energy consumption while ensuring safety distance and energy margins through constraint penalties. Simulation results demonstrate that compared to single-agent deep deterministic policy gradient and independent Q-learning baseline methods, the proposed method achieves approximately 50% improvement in convergence speed, 12% to 18% increase in user service satisfaction, 25% to 40% improvement in system throughput, 30% to 45% reduction in end-to-end delay, and 39% to 102% improvement in energy efficiency. The framework dynamically adjusts network topology and routing strategies according to user demands, providing a deployable solution for large-scale vehicle swarm communication networks.
Unmanned aerial vehicle swarms / Communication networking / Routing optimization / Multi-agent deep reinforcement learning / Graph attention network / Trajectory planning
| [1] |
M. Ahmed, A.A. Nasir, M. Masood, K.A. Memon, K.K. Qureshi, F. Khan, W.U. Khan, F. Xu, Z. Han, Advancements in uav-based integrated sensing and communication: a comprehensive survey (2025). arXiv preprint. arXiv:2501.06526 |
| [2] |
|
| [3] |
K. Meng, C. Masouros, A.P. Petropulu, L. Hanzo, Cooperative Isac networks: Opportunities and challenges (2025). arXiv preprint. arXiv:2405.06305 |
| [4] |
K. Han, K. Meng, X.-Y. Wang, C. Masouros, Network-level Isac design: State-of-the-art, challenges, and opportunities (2025). arXiv preprint. arXiv:2505.01295 |
| [5] |
Z. Zhai, W. Ni, X. Wang, D. Niyato, E. Hossain, Integrated sensing and communication with uav swarms via decentralized consensus admm (2025). arXiv preprint. arXiv:2511.03283 |
| [6] |
|
| [7] |
J. Beuster, C. Andrich, S. Giehl, M. Miranda, L. Mohr, D. Novotny, T. Kaufmann, Enhancing situational awareness in Isac networks via drone swarms: a real-world channel sounding data set (2025). arXiv preprint. arXiv:2507.12010 |
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
K.K. Nguyen, T.Q. Duong, T. Do-Duy, H. Claussen, L. Hanzo, 3d UAV trajectory and data collection optimisation via deep reinforcement learning (2021). arXiv preprint. arXiv:2106.03129 |
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
|
| [28] |
C.-W. Fu, M.-L. Ku, Energy-efficient federated learning for uav communications (2025). arXiv preprint. arXiv:2508.03171 |
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
X. Yang, M. Liwang, L. Fu, Y. Su, S. Hosseinalipour, Adaptive uav-assisted hierarchical federated learning: optimizing energy, latency, and resilience for dynamic smart iot (2025). arXiv preprint. arXiv:2503.06145 |
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
|
| [40] |
|
| [41] |
|
| [42] |
|
| [43] |
|
| [44] |
|
| [45] |
|
The Author(s)
/
| 〈 |
|
〉 |