Reinforcement learning with parameterized action space and sparse reward for UAV navigation

Shiying Feng , Xiaofeng Li , Lu Ren , Shuiqing Xu

Intelligence & Robotics ›› 2023, Vol. 3 ›› Issue (2) : 161 -75.

PDF
Intelligence & Robotics ›› 2023, Vol. 3 ›› Issue (2) :161 -75. DOI: 10.20517/ir.2023.10
Research Article
Research Article

Reinforcement learning with parameterized action space and sparse reward for UAV navigation

Author information +
History +
PDF

Abstract

Autonomous navigation of unmanned aerial vehicles (UAVs) is widely used in building rescue systems. As the complexity of the task increases, traditional methods based on environment models are hard to apply. In this paper, a reinforcement learning (RL) algorithm is proposed to solve the UAV navigation problem. The UAV navigation task is modeled as a Markov Decision Process (MDP) with parameterized actions. In addition, the sparse reward problem is also taken into account. To address these issues, we develop the HER-MPDQN by combining Multi-Pass Deep Q-Network (MP-DQN) and Hindsight Experience Replay (HER). Two UAV navigation simulation environments with progressive difficulty are constructed to evaluate our method. The results show that HER-MPDQN outperforms other baselines in relatively simple tasks. Especially for complex tasks involving relay operations, only our method can achieve satisfactory performance.

Keywords

Deep reinforcement learning / parameterized action space / sparse reward

Cite this article

Download citation ▾
Shiying Feng, Xiaofeng Li, Lu Ren, Shuiqing Xu. Reinforcement learning with parameterized action space and sparse reward for UAV navigation. Intelligence & Robotics, 2023, 3(2): 161-75 DOI:10.20517/ir.2023.10

登录浏览全文

4963

注册一个新账户 忘记密码

References

AI Summary AI Mindmap
PDF

63

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/