搜索结果: 1-1 共查到“信息与通信工程 memory mechanism”相关记录1条 . 查询时间(0.437 秒)
Exploration is used in Q_learning because the agent will be caught in locally optimal policies due to blind exploitation.However excessive exploration will degrade the performance of Q_learning and it...