UAV maneuver decision-making via deep reinforcement learning for short-range air combat

Zhiqiang Zheng , Haibin Duan

Intelligence & Robotics ›› 2023, Vol. 3 ›› Issue (1) : 76 -94.

PDF
Intelligence & Robotics ›› 2023, Vol. 3 ›› Issue (1) :76 -94. DOI: 10.20517/ir.2023.04
Research Article
Research Article

UAV maneuver decision-making via deep reinforcement learning for short-range air combat

Author information +
History +
PDF

Abstract

The unmanned aerial vehicle (UAV) has been applied in unmanned air combat because of its flexibility and practicality. The short-range air combat situation is rapidly changing, and the UAV has to make the autonomous maneuver decision as quickly as possible. In this paper, a type of short-range air combat maneuver decision method based on deep reinforcement learning is proposed. Firstly, the combat environment, including UAV motion model and the position and velocity relationships, is described. On this basic, the combat process is established. Secondly, some improved points based on proximal policy optimization (PPO) are proposed to enhance the maneuver decision-making ability. The gate recurrent unit (GRU) can help PPO make decisions with continuous timestep data. The actor network's input is the observation of UAV, however, the input of the critic network, named state, includes the blood values which cannot be observed directly. In addition, the action space with 15 basic actions and well-designed reward function are proposed to combine the air combat environment and PPO. In particular, the reward function is divided into dense reward, event reward and end-game reward to ensure the training feasibility. The training process is composed of three phases to shorten the training time. Finally, the designed maneuver decision method is verified through the ablation study and confrontment tests. The results show that the UAV with the proposed maneuver decision method can obtain an effective action policy to make a more flexible decision in air combat.

Keywords

Short-range air combat / unmanned aerial vehicle / deep reinforcement learning / maneuver decision / proximal policy optimization / flight simulation

Cite this article

Download citation ▾
Zhiqiang Zheng, Haibin Duan. UAV maneuver decision-making via deep reinforcement learning for short-range air combat. Intelligence & Robotics, 2023, 3(1): 76-94 DOI:10.20517/ir.2023.04

登录浏览全文

4963

注册一个新账户 忘记密码

References

AI Summary AI Mindmap
PDF

115

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/