Application of hierarchical reinforcement learning in engineering domain
Wei Li , Qingtai Ye , Changming Zhu
Journal of Systems Science and Systems Engineering ›› 2005, Vol. 14 ›› Issue (2) : 207 -217.
Application of hierarchical reinforcement learning in engineering domain
The slow convergence rate of reinforcement learning algorithms limits their wider application. In engineering domains, hierarchical reinforcement learning is developed to perform actions temporally according to prior knowledge. This system can converge fast due to reduced state space. There is a test of elevator group control to show the power of the new system. Two conventional group control algorithms are adopted as prior knowledge. Performance indicates that hierarchical reinforcement learning can reduce the learning time dramatically.
Engineering domain knowledge / controller / reinforcement learning / elevator / group control
| [1] |
Bao, G., C. G. Cassandras, T. E. Djaferis, A.D. Gandhi, and D. P. Looze, “Elevator dispatchers for down peak traffic”, ECE Department Technical Report, University of Massachusetts, 1994. |
| [2] |
|
| [3] |
Bradtke, S. J. and M. O. Duff, “Reinforcement learning methods for continuous-time Markov decision problems”, Advances in Neural Information Processing Systems 7, Cambridge, MA, 1995. |
| [4] |
Crites, R. H. and A. G. Barto, “Improving elevator performance using reinforcement learning”, Advances in Neural Information Processing Systems 8, pp1017–1023, 1996. |
| [5] |
Mahadevan, S., M. Nicholas, D. Tapas. and G. Abhijit, “Self-Improving factory simulation using continuous-time average-reward reinforcement learning”, Proceedings of the 14th International Conference on Machine Learning (IMLC’ 97), Nashville, TN, 1997. |
| [6] |
|
| [7] |
|
| [8] |
Rajbala, M., M. Sridhar, and G. Mohammad, “Hierarchical multi-agent reinforcement learning”, Proceedings of the fifth International Conference on Autonomous Agents, pp246–253, 2001. |
| [9] |
|
| [10] |
|
| [11] |
Zhang, W., “Reinforcement learning for job-shop scheduling”, PhD Thesis, Oregon State University. 1996. |
/
| 〈 |
|
〉 |