稀疏奖励场景下基于状态空间探索的多智能体强化学习算法
方宝富1,2, 余婷婷1,2, 王浩1,2, 王在俊3

Multi-agent Reinforcement Learning Algorithm Based on State Space Exploration in Sparse Reward Scenarios
FANG Baofu1,2, YU Tingting1,2, WANG Hao1,2, WANG Zaijun3
3s5z_vs_3s6z场景中各算法平均胜率对比