稀疏奖励场景下基于状态空间探索的多智能体强化学习算法
方宝富
1,
2
, 余婷婷
1,
2
, 王浩
1,
2
, 王在俊
3
Multi-agent Reinforcement Learning Algorithm Based on State Space Exploration in Sparse Reward Scenarios
FANG Baofu
1,
2
, YU Tingting
1,
2
, WANG Hao
1,
2
, WANG Zaijun
3
状态子集空间