基于内省推理的多agent在线学习方法

Abstract
Figure/Table
References
Related Citation (9)

Download: PDF (422 KB) HTML (1 KB)
Export: BibTeX | EndNote (RIS)

Abstract In multiagent environment, the optimal policy of an agent depends on the policies of the others, which makes the learning more problematic. Previous algorithms based on the observed behavior of opponents can not fully present individual rationality. An efficient online learning algorithm based on the internal inference is proposed, which integrates the observed objective behavior and the subjective inferential intention of the opponents. By the internal inference, agents can obtain more information about opponents, and thus learn more efficiently. Simulations results prove that the proposed algorithm performs well in classical coordination game.

Key words： Multiagent System OnlineLearning Internal Inference Electronic Market

Received: 16 May 2005

ZTFLH:

TP181.1

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	HAN Wei
	CHEN YouGuang
	JIANG ChangHua

Cite this article:

HAN Wei,CHEN YouGuang,JIANG ChangHua. An InternalInference Based Multiagent Learning Method[J]. , 2007, 20(2): 254-260.

URL:

http://manu46.magtech.com.cn/Jweb_prai/EN/ OR http://manu46.magtech.com.cn/Jweb_prai/EN/Y2007/V20/I2/254

[1] Littman M L. Markov Games as a Framework for MultiAgent Reinforcement Learning // Cohen W W, Hirsh H, eds. Proc of the 11th International Conference on Machine Learning. New Brunswick, USA, 1994: 157163
[2] Hu Junling, Wellman M P. Multiagent Reinforcement Learning: Theoretical Framework and Algorithm // Proc of the 15th International Conference on Machine Learning. Madison, USA, 1998: 242250
[3] Bowling M, Veloso M. Rational and Convergent Learning in Stochastic Games // Proc of the 17th International Joint Conference of Artificial Intelligence. Seattle, USA, 2001: 10211026
[4] Bowling M, Veloso M. Multiagent Learning Using a Variable Learning Rate. Artificial Intelligence, 2002, 136(2): 215250
[5] Shapley L S. Stochastic Games. Proc of the National Academy of Sciences,1953, 39: 10951100
[6] Weiss G, Sen S. Adaptation and Learning in Multiagent Systems // Weiss G, Sen S, eds. Lecture Notes in Artificial Intelligence. Berlin, Germany: SpringerVerlag, 1996, 1042: 221229
[7] Stone P, Veloso M. Multiagent Systems: A Survey from a Machine Learning Perspective. Autonomous Robotics, 2002, 8(3): 345383
[8] Sutton R S, Barto A G. Reinforcement Learning. Cambridge, USA: MIT Press, 1998
[9] Claus C, Boutilier C. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems // Proc of the 15th National Conference on Artificial Intelligence. Cambridge, USA: MIT Press, 1997: 235262
[10] Fudenberg D, Levine D K. The Theory of Learning in Games. Cambridge, USA: MIT Press, 1998
[11] Brafman R I, Tennenholtz M. Learning to Coordinate Efficiently: A Model Based Approach. Journal of Artificial Intelligence Research, 2003, 19(1): 1123
[12] Copper R W. Coordination Games: Complementarities and Macroeconomics. Cambridge, UK: Cambridge University Press, 1998
[13] Mataric M J. Interaction and Intelligent Behavior. Ph.D Dissertation. Cambridge, USA: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, 1994: 2223