强化学习
钢筋
计算机科学
人工智能
心理学
社会心理学
作者
Lucian Busoniu,Robert Babuska,Bart De Schutter
出处
期刊:IEEE transactions on systems, man and cybernetics
[Institute of Electrical and Electronics Engineers]
日期:2008-03-01
卷期号:38 (2): 156-172
被引量:1569
标识
DOI:10.1109/tsmcc.2007.913919
摘要
Multiagent systems are rapidly finding applications in a variety of domains, including robotics, distributed control, telecommunications, and economics. The complexity of many tasks arising in these domains makes them difficult to solve with preprogrammed agent behaviors. The agents must, instead, discover a solution on their own, using learning. A significant part of the research on multiagent learning concerns reinforcement learning techniques. This paper provides a comprehensive survey of multiagent reinforcement learning (MARL). A central issue in the field is the formal statement of the multiagent learning goal. Different viewpoints on this issue have led to the proposal of many different goals, among which two focal points can be distinguished: stability of the agents' learning dynamics, and adaptation to the changing behavior of the other agents. The MARL algorithms described in the literature aim---either explicitly or implicitly---at one of these two goals or at a combination of both, in a fully cooperative, fully competitive, or more general setting. A representative selection of these algorithms is discussed in detail in this paper, together with the specific issues that arise in each category. Additionally, the benefits and challenges of MARL are described along with some of the problem domains where the MARL techniques have been applied. Finally, an outlook for the field is provided.
科研通智能强力驱动
Strongly Powered by AbleSci AI