模式识别与人工智能
Tuesday, Apr. 22, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
  2013, Vol. 26 Issue (9): 838-844    DOI:
Orignal Article Current Issue| Next Issue| Archive| Adv Search |
Heuristically Accelerated State Backtracking Q-Learning Based on Cost Analysis
FANG Min,LI Hao
School of Computer Science and Technology,Xidian University,Xi′an 710071

Download: PDF (495 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  Since action strategy learning is time-consuming for the reinforcement learning algorithm,a heuristic reinforcement learning algorithm is presented based on state backtracking. By analyzing the repetitive states and comparing the action policies of the reinforcement learning,a cost function is defined to indicate the importance of repetitive actions. A probability-based heuristic function is presented by combining an action reward with an action cost. The proposed algorithm reinforces the importance of an action to speed up learning by the heuristic function and measures the feasibility of an action to reduce unnecessary exploration by the cost function at the same time,thus the learning efficiency is steadily improve. This cost-based action strategy is proved to be reasonable. Two simulation scenarios are built and the experimental results of robot games prove that the proposed algorithm can learn by the tradeoff between rewards and costs,and effectively improve the convergence of Q-learning.
Key wordsCost Analysis      Heuristic Function      State Backtracking      Q-Learning     
Received: 13 August 2012     
ZTFLH: TP181  
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
FANG Min
LI Hao
Cite this article:   
FANG Min,LI Hao. Heuristically Accelerated State Backtracking Q-Learning Based on Cost Analysis[J]. , 2013, 26(9): 838-844.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2013/V26/I9/838
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn