An Autonomous Planning Method for Deep Space Exploration Tasks in Reinforcement Learning Based on Dynamic Rewards
MAO Weiyang1, WANG Bin1,2, LIU Jingxing1, XIONG Xin1
An Autonomous Planning Method for Deep Space Exploration Tasks in Reinforcement Learning Based on Dynamic Rewards
deep space exploration / task planning / policy gradient / reinforcement learning / dynamic reward
/
〈 | 〉 |