Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrial wireless networks

Xiaoyu LIU; Chi XU; Haibin YU; Peng ZENG

doi:10.1631/FITEE2100331

PDF(5563 KB)

Front. Inform. Technol. Electron. Eng ›› 2022, Vol. 23 ›› Issue (1) : 47-60. DOI: 10.1631/FITEE2100331

Orginal Article

Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrial wireless networks

Xiaoyu LIU¹^,²^,³^,⁴ ,
Chi XU¹^,²^,³ ,
Haibin YU¹^,²^,³ ,
Peng ZENG¹^,²^,³

Author information +

History +

Abstract

Edge artificial intelligence will empower the ever simple industrial wireless networks (IWNs) supporting complex and dynamic tasks by collaboratively exploiting the computation and communication resources of both machine-type devices (MTDs) and edge servers. In this paper, we propose a multi-agent deep reinforcement learning based resource allocation (MADRL-RA) algorithm for end–edge orchestrated IWNs to support computation-intensive and delay-sensitive applications. First, we present the system model of IWNs, wherein each MTD is regarded as a self-learning agent. Then, we apply the Markov decision process to formulate a minimum system overhead problem with joint optimization of delay and energy consumption. Next, we employ MADRL to defeat the explosive state space and learn an effective resource allocation policy with respect to computing decision, computation capacity, and transmission power. To break the time correlation of training data while accelerating the learning process of MADRL-RA, we design a weighted experience replay to store and sample experiences categorically. Furthermore, we propose a step-by-step ε-greedy method to balance exploitation and exploration. Finally, we verify the effectiveness of MADRL-RA by comparing it with some benchmark algorithms in many experiments, showing that MADRL-RA converges quickly and learns an effective resource allocation policy achieving the minimum system overhead.

Keywords

Multi-agent deep reinforcement learning / End–edge orchestrated / Industrial wireless networks / Delay / Energy consumption

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Xiaoyu LIU, Chi XU, Haibin YU, Peng ZENG. Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrial wireless networks. Front. Inform. Technol. Electron. Eng, 2022, 23(1): 47‒60 https://doi.org/10.1631/FITEE2100331