運用同化與調適於多代理人的合作學習

Adapting learning is the essential ability to improve the convergence rate and learning quality in the multi-agent system. A cooperative mechanism needs learning cooperatively between agents. This research applies assimilation and accommodation in complex environment to product effective action. There are intentional schema and perceptional schema in our assimilation and accommodation. In intentional schema, reinforcement learning is used to choose target state. In perceptional schema, back-propagation neural network is used to predict environmental forward dynamics. When the error between predicting state and actual state is too large, it means our knowledge can’t assimilate this sample. So we must adjust our knowledge to fit it. This is an accommodation process. We use the RoboCup simulator to explain our research.

並列關鍵字

RoboCup ； Assimilation ； Accommodation ； Q-Learning ； Neural Network

參考文獻

[1] A. Rao, M. Georgeff, “BDI Agents: From Theory to Practice”, International Conference on Multi-Agent Systems, 1995.

[2] Marko Verbeek. “3APL as programming language for cognitive robots”, Masters' Thesis Technical Artificial Intelligence Computer Science, 2002.

[3] E.C ten Hoeve, “3APL Platform”, Master’s thesis Computer Science, 2003.

[4] J. Y. Kuo, M. L. Tsai, and N. L. Hsueh, “Goal Evolution based on Adaptive Q-learning for Intelligent Agent”, IEEE International Conference on Systems, 2006.

[8] L. Waltman, U. Kaymak, “A Theoretical Analysis of Cooperative Behavior in Multi-agent Q-learning”, IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning, pp. 84-91, 2007.

國際替代計量

運用同化與調適於多代理人的合作學習

全文下載

主題瀏覽