PDF(516 KB)
一种结合TileCoding的平均奖赏强化学习算法*
王巍巍,陈兴国,高阳
PDF(516 KB)
一种结合TileCoding的平均奖赏强化学习算法*
An Average Reward Reinforcement Learning Algorithm with Tile Coding
| {{custom_ref.label}} |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
/
| 〈 |
|
〉 |