A Sarsa reinforcement learning hybrid ensemble method for robotic battery power forecasting
Fei Peng , Hui Liu , Li Zheng
Journal of Central South University ›› 2023, Vol. 30 ›› Issue (11) : 3867 -3880.
A Sarsa reinforcement learning hybrid ensemble method for robotic battery power forecasting
Building a rail transit workshop with efficient data interconnection has become an inevitable trend in the transformation and development of the current rail transit equipment industry. More and more diversified mobile transport robots have become a priority in the process of digital transformation of smart factories. Accurate prediction of robot battery power can guide the control center to adopt scientific and reasonable instructions in advance to ensure efficient and stable operation of the logistics transportation chain. In this study, we propose a hybrid ensemble method of multiple learners based on state-action-reward-state-action (Sarsa) reinforcement learning algorithm. Maximal overlap discrete wavelet transform (MODWT) is used to preprocess the originally measured robot power supply voltage data. This significantly reduces the non-stationarity and volatility of time series data. Gated recurrent unit (GRU), deep belief network (DBN), and long short-term memory (LSTM), are utilized for the prediction modeling of subseries after decomposition. Finally, the Sarsa reinforcement learning ensemble strategy is used to weight the three basic predictors above. The performance of the Sarsa hybrid model is verified on three real mobile robot power data sets. Experimental results elaborate that the transportation robot battery power hybrid forecasting model is competitive in robustness, accuracy, and adaptability.
robotic power management / transportation robot / time series forecasting / deep learning / Sarsa reinforcement learning / ensemble model
| [1] |
JONES J L, SEIGER B A, FLYNN A M. Mobile robots: Inspiration to implementation [M]. AK Peters/CRC Press, 1998. |
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
PENTZER J, BRENNAN S, REICHARD K. On-line estimation of vehicle motion and power model parameters for skid-steer robot energy use prediction [C]// 2014 American Control Conference. IEEE, 2014: 2786–2791. |
| [25] |
|
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
|
| [40] |
|
| [41] |
|
| [42] |
|
| [43] |
|
| [44] |
|
| [45] |
|
| [46] |
|
| [47] |
|
| [48] |
|
| [49] |
|
| [50] |
SIAMI-NAMINI S, NAMIN A. Forecasting economics and financial time series: ARIMA vs LSTM [J]. arXiv preprint arXiv:180306386, 2018. |
| [51] |
|
| [52] |
|
| [53] |
|
| [54] |
|
| [55] |
|
| [56] |
|
| [57] |
|
| [58] |
|
| [59] |
|
| [60] |
|
/
| 〈 |
|
〉 |