Parameter Selection in Reinforcement Learning

強迫式學習是機器學習的一個分支，其應用場合之一為無明確命令之自主機器人控制。機器人在行動過程中，僅能獲得「好」「壞」程度的回饋訊息，並不像監督式學習有一個明確的答案。看似極具威力的學習方式，卻受到參數極大的影響。本文將以系統化的方法探討參數的選擇對強迫式機器學習法的影響。我們將以Udacity磨課師(MOOCs)課程網站的案例，講述過程與結果。

關鍵字

自動駕駛車；強迫式機器學習；參數選擇；馬可夫決策過程

並列摘要

In order to clarify how parameters could influence the performance of reinforcement learning, this paper provides a systematical approach to study the relation between the selection of parameters and the performance. We will provide a brief review of reinforcement learning. Then, we adopt an example from a well-known MOOC platform, Udacity, to demonstrate how reinforcement learning works and the relation between parameters and performance. Simulation results will also be provided.

並列關鍵字

self-driving car ； reinforcement learning ； parameter selection ； Markov decision processes

國際替代計量

Parameter Selection in Reinforcement Learning

全文下載

主題瀏覽