Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents

Jian ZHAO , Youpeng ZHAO , Weixun WANG , Mingyu YANG , Xunhan HU , Wengang ZHOU , Jianye HAO , Houqiang LI

Front. Inform. Technol. Electron. Eng ›› 2022, Vol. 23 ›› Issue (7) : 1032 -1042.

PDF (15061KB)
Front. Inform. Technol. Electron. Eng ›› 2022, Vol. 23 ›› Issue (7) : 1032 -1042. DOI: 10.1631/FITEE.2100594
Orginal Article
Orginal Article

Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents

Author information +
History +
PDF (15061KB)

Abstract

Multi-agent reinforcement learning is difficult to apply in practice, partially because of the gap between simulated and real-world scenarios. One reason for the gap is that simulated systems always assume that agents can work normally all the time, while in practice, one or more agents may unexpectedly “crash” during the coordination process due to inevitable hardware or software failures. Such crashes destroy the cooperation among agents and lead to performance degradation. In this work, we present a formal conceptualization of a cooperative multi-agent reinforcement learning system with unexpected crashes. To enhance the robustness of the system to crashes, we propose a coach-assisted multi-agent reinforcement learning framework that introduces a virtual coach agent to adjust the crash rate during training. We have designed three coaching strategies (fixed crash rate, curriculum learning, and adaptive crash rate) and a re-sampling strategy for our coach agent. To our knowledge, this work is the first to study unexpected crashes in a multi-agent system. Extensive experiments on grid-world and StarCraft II micromanagement tasks demonstrate the efficacy of the adaptive strategy compared with the fixed crash rate strategy and curriculum learning strategy. The ablation study further illustrates the effectiveness of our re-sampling strategy.

Keywords

Multi-agent system / Reinforcement learning / Unexpected crashed agents

Cite this article

Download citation ▾
Jian ZHAO, Youpeng ZHAO, Weixun WANG, Mingyu YANG, Xunhan HU, Wengang ZHOU, Jianye HAO, Houqiang LI. Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents. Front. Inform. Technol. Electron. Eng, 2022, 23(7): 1032-1042 DOI:10.1631/FITEE.2100594

登录浏览全文

4963

注册一个新账户 忘记密码

References

RIGHTS & PERMISSIONS

Zhejiang University Press

AI Summary AI Mindmap
PDF (15061KB)

Supplementary files

FITEE-1032-22006-JZ_suppl_1

FITEE-1032-22006-JZ_suppl_2

480

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/