本篇論文結合皮亞傑同化調適的概念,以及強效式學習和倒傳遞類神經網路技術,發展代理人適應動態多變環境的知識與能力。智慧型代理人面對不確定的環境資訊,根據舊有的知識與能力,對自己的知識架構進行同化或調適,使其能更適應環境的變化。我們以Robocup足球模擬平台進行足球競賽,以驗證本研究所提出的方法。
Adapting learning is the essential ability to improve the convergence rate and learning quality in the multi-agent system. A cooperative mechanism needs learning cooperatively between agents. This research applies assimilation and accommodation in complex environment to product effective action. There are intentional schema and perceptional schema in our assimilation and accommodation. In intentional schema, reinforcement learning is used to choose target state. In perceptional schema, back-propagation neural network is used to predict environmental forward dynamics. When the error between predicting state and actual state is too large, it means our knowledge can’t assimilate this sample. So we must adjust our knowledge to fit it. This is an accommodation process. We use the RoboCup simulator to explain our research.