足球代理人的適應與合作學習

足球機器人的比賽，是許多多代理人系統研究者所拿來作為其理論的效益指標，然而，一個足球機器人的比賽，是需要三個具有合作能力的機器人，面對不同的足球場環境，能夠使用有效的合作策略，達到得分的目的。由於比賽中的足球場上的環境狀態不停改變，監督式學習或是使用單一學習的方法，很難讓足球機器人在比賽前都學習到將會遇到的不同環境，本篇將使用皮亞傑的同化與調適結合Q-earning 的方法，來使我們的足球機器人能夠在比賽中，能夠自我學習、自我組織足球場上的知識，並累積得分的策略，達到得分的目的。

關鍵字

RoboCup ；同化；調適； Q-Learning

並列摘要

There are many multi-agent researchers using RoboCup game to evaluate the efficient of their multi-agent methods. A robot has to face variety environments in a RoboCup game. Using supervisor learning or other single learning method is useless for a robot agent to adapt an environment or cooperate with other robots. In this paper, we integrate Piaget’s assimilation and accommodation processes with Q-Learning algorithm based on BDI model in our robot agent. We show our robot agent can be self-learning and self-organization and cooperate with other robot agents in a RoboCup game.

並列關鍵字

RoboCup ； Assimilation ； Accommodation ； Q-Learning

參考文獻

[1] A. Rao, and M. Georgeff, “BDI Agents: From Theory to Practice”, International

Conference on Multi-Agent Systems, San Francisco, USA, 1995.

Intelligence, 2000.

[6] T. Taniguchi, T. Sawaragi, “Assimilation and Accommodation for

Self-organizational Learning of autonomous Robots: Proposal of Dual-Schemata

國際替代計量

足球代理人的適應與合作學習

全文下載

主題瀏覽