Dynamic value iteration networks for the planning of rapidly changingUAVswarms

Wei LI; Bowei YANG; Guanghua SONG; Xiaohong JIANG

doi:10.1631/FITEE.1900712

PDF(3677 KB)

Front. Inform. Technol. Electron. Eng ›› 2021, Vol. 22 ›› Issue (5) : 687-696. DOI: 10.1631/FITEE.1900712

Orginal Article

Dynamic value iteration networks for the planning of rapidly changingUAVswarms

Author information +

History +

Abstract

In an unmanned aerial vehicle ad-hoc network (UANET), sparse and rapidly mobile unmanned aerial vehicles (UAVs)/nodes can dynamically change the UANET topology. This may lead to UANET service performance issues. In this study, for planning rapidly changing UAV swarms, we propose a dynamic value iteration network (DVIN) model trained using the episodic Q-learning method with the connection information of UANETs to generate a state value spread function, which enables UAVs/nodes to adapt to novel physical locations. We then evaluate the performance of the DVIN model and compare it with the non-dominated sorting genetic algorithm II and the exhaustive method. Simulation results demonstrate that the proposed model significantly reduces the decisionmaking time for UAV/node path planning with a high average success rate.

Keywords

Dynamic value iteration networks / Episodic Q-learning / Unmanned aerial vehicle (UAV) ad-hoc network / Non-dominated sorting genetic algorithm II (NSGA-II) / Path planning

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Wei LI, Bowei YANG, Guanghua SONG, Xiaohong JIANG. Dynamic value iteration networks for the planning of rapidly changingUAVswarms. Front. Inform. Technol. Electron. Eng, 2021, 22(5): 687‒696 https://doi.org/10.1631/FITEE.1900712