Motion planning of a quadrotor robot game using a simulation-based projected policy iteration method

Li-dong ZHANG; Ban WANG; Zhi-xiang LIU; You-min ZHANG; Jian-liang AI

doi:10.1631/FITEE.1800571

PDF(1207 KB)

Front. Inform. Technol. Electron. Eng ›› 2019, Vol. 20 ›› Issue (4) : 525-537. DOI: 10.1631/FITEE.1800571

Regular Papers

Motion planning of a quadrotor robot game using a simulation-based projected policy iteration method

Author information +

History +

Abstract

Making rational decisions for sequential decision problems in complex environments has been challenging researchers in various fields for decades. Such problems consist of state transition dynamics, stochastic uncertainties, long-term utilities, and other factors that assemble high barriers including the curse of dimensionality. Recently, the state-of-the-art algorithms in reinforcement learning studies have been developed, providing a strong potential to efficiently break the barriers and make it possible to deal with complex and practical decision problems with decent performance. We propose a formulation of a velocity varying one-on-one quadrotor robot game problem in the threedimensional space and an approximate dynamic programming approach using a projected policy iteration method for learning the utilities of game states and improving motion policies. In addition, a simulation-based iterative scheme is employed to overcome the curse of dimensionality. Simulation results demonstrate that the proposed decision strategy can generate effective and efficient motion policies that can contend with the opponent quadrotor and gather advantaged status during the game. Flight experiments, which are conducted in the Networked Autonomous Vehicles (NAV) Lab at the Concordia University, have further validated the performance of the proposed decision strategy in the real-time environment.

Keywords

Reinforcement learning / Approximate dynamic programming / Decision making / Motion planning / Unmanned aerial vehicle

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Li-dong ZHANG, Ban WANG, Zhi-xiang LIU, You-min ZHANG, Jian-liang AI. Motion planning of a quadrotor robot game using a simulation-based projected policy iteration method. Front. Inform. Technol. Electron. Eng, 2019, 20(4): 525‒537 https://doi.org/10.1631/FITEE.1800571