Multi-agent Confrontation System Based on Reinforcement Learning

Qianying Li; Baolong Guo; Zhe Huang and Cheng Li

doi:10.23977/ESAC2020026

Multi-agent Confrontation System Based on Reinforcement Learning

Download as PDF

DOI: 10.23977/ESAC2020026

Author(s)

Qianying Li, Baolong Guo, Zhe Huang and Cheng Li

Corresponding Author

Qianying Li

ABSTRACT

The development of machine learning technology has so far no longer relied entirely on artificial power in adversarial decision-making, but there has been an updated decision competition. The climax of the development of artificial intelligence and machine computing capabilities has provided powerful technical support and hardware support for the development of intelligent adversarial systems. The improved actor-critic extended multi-agent strategy gradient method (ACEM) is mainly used to optimize the behavior decision of multi-agent cooperation and hostility. This algorithm introduces extended strategy review information on the basic actor-critic framework for reward calculation. The use of adversarial strategy inference reduces the impact of environmental instability on strategy selection. This method has significantly improved the degree of completion of target tasks in cooperative and hostile environments.

KEYWORDS

Multi-agent; Adversarial System; Deep Reinforcement Learning; Actor-Critic Framework

Multi-agent Confrontation System Based on Reinforcement Learning

Author(s)

Corresponding Author

ABSTRACT

KEYWORDS

RESOURCES

JOIN US

PUBLICATION SERVICES

CONTACT US