稀疏奖励场景下基于状态空间探索的多智能体强化学习算法
方宝富, 余婷婷, 王浩, 王在俊
Multi-agent Reinforcement Learning Algorithm Based on State Space Exploration in Sparse Reward Scenarios
FANG Baofu, YU Tingting, WANG Hao, WANG Zaijun
模式识别与人工智能
.
2024, (5): 435
-446
.
DOI: 10.16451/j.cnki.issn1003-6059.202405005