论文部分内容阅读
In multi-agent systems, joint-action must be employed to achieve cooperation because the evaluation of the behavior of an agent often depends on the other agents' behaviors. However, joint-action reinforcement learning algorithms suffer the slow conve