基于状态回溯代价分析的启发式Q学习
方敏,李浩
Heuristically Accelerated State Backtracking Q-Learning Based on Cost Analysis
FANG Min,LI Hao
模式识别与人工智能 . 2013, (9): 838 -844 .