一种基于PSO的分层策略搜索算法<sup>*</sup>

Abstract
Figure/Table
References
Related Citation (3)

Download: PDF (403 KB) HTML (1 KB)
Export: BibTeX | EndNote (RIS)

Abstract In order to overcome drawbacks in hierarchical policy gradient reinforcement learning algorithm (HPGRL), such as problem of local optimum, a new algorithm for searching hierarchical policies is proposed, named Hierarchical Policy Search Based on PSO (PSOHPS). The designers create the task decomposition graph according to the hierarchical theory of MAXQ, one of the classical hierarchical reinforcement learning techniques. Then the hierarchical parameterized policies of all compound subtasks are evolved in process of direct interaction with the environment by utilizing a particle swarm to acquire the optimized action policies. Experimental results demonstrate the algorithm is valid and its performance outperforms that of HPGRL remarkably.

Key words： Hierarchical Reinforcement Learning Particle Swarm Optimization (PSO) Hierarchical Policies Negotiation Deadlock

Received: 07 December 2006

ZTFLH:

TP181

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	PENG ZhiPing
	LI ShaoPing

Cite this article:

PENG ZhiPing,LI ShaoPing. An Algorithm for Hierarchical Policy Search Based on PSO[J]. , 2008, 21(1): 98-103.

URL:

http://manu46.magtech.com.cn/Jweb_prai/EN/ OR http://manu46.magtech.com.cn/Jweb_prai/EN/Y2008/V21/I1/98