Improving deep reinforcement learning by safety guarding model via hazardous experience planning
Pai PENG , Fei ZHU , Xinghong LING , Peiyao ZHAO , Quan LIU
Front. Comput. Sci. ›› 2022, Vol. 16 ›› Issue (4) : 164320
Improving deep reinforcement learning by safety guarding model via hazardous experience planning
| [1] |
|
| [2] |
Cheng R, Orosz G, Murray R M, Burdick J W. End-to-end safe reinforcement learning through barrier functions for safety-critical continuous control tasks. In: Proceedings of the AAAI Conference on Artificial Intelligence. 2019, 3387−3395 |
| [3] |
Saunders W, Sastry G, Stuhlmueller A, Evans O. Trial without error: towards safe reinforcement learning via human intervention. In: Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems. 2018, 2067−2069 |
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
Higher Education Press
/
| 〈 |
|
〉 |