Why Learning Has Aha-Moments and Why We Should Also Reward Effort, Not Just Results

Traditionally, in machine learning, the quality of the result improves steadily with time (usually slowly but still steadily). However, as we start applying reinforcement learning techniques to solve complex tasks-such as teaching a computer to play a complex game like Go-we often encounter a situation in which for a long time, there is no improvement, and then suddenly, the system's efficiency jumps almost to its maximum. A similar phenomenon occurs in human learning, where it is known as the aha-moment. In this paper, we provide a possible explanation for this phenomenon, and show that this explanation leads to the need to reward students for effort, not only for their results.

關鍵字

Machine learning ； Reinforcement learning ； Aha-moment ； Human learning ； Rewards to students

國際替代計量

全文下載

主題瀏覽

Why Learning Has Aha-Moments and Why We Should Also Reward Effort, Not Just Results

摘要

關鍵字

延伸閱讀

國際替代計量

本網站使用Cookies