透過您的圖書館登入
IP:3.135.204.216
  • 會議論文
  • OpenAccess

Parameter Selection in Reinforcement Learning

摘要


強迫式學習是機器學習的一個分支,其應用場合之一為無明確命令之自主機器人控制。機器人在行動過程中,僅能獲得「好」「壞」程度的回饋訊息,並不像監督式學習有一個明確的答案。看似極具威力的學習方式,卻受到參數極大的影響。本文將以系統化的方法探討參數的選擇對強迫式機器學習法的影響。我們將以Udacity磨課師(MOOCs)課程網站的案例,講述過程與結果。

並列摘要


In order to clarify how parameters could influence the performance of reinforcement learning, this paper provides a systematical approach to study the relation between the selection of parameters and the performance. We will provide a brief review of reinforcement learning. Then, we adopt an example from a well-known MOOC platform, Udacity, to demonstrate how reinforcement learning works and the relation between parameters and performance. Simulation results will also be provided.

延伸閱讀