基于优质样本筛选的离线强化学习算法
侯永宏, 丁旺, 任懿, 董洪伟, 杨松领
Offline Reinforcement Learning Algorithm Based on Selection of High-Quality Samples
HOU Yonghong, DING Wang, REN Yi, DONG Hongwei, YANG Songling
模式识别与人工智能
.
2024, (11): 1022
-1032
.
DOI: 10.16451/j.cnki.issn1003-6059.202411007