强化学习
人工神经网络
计算机科学
马尔可夫决策过程
理论(学习稳定性)
功能(生物学)
分布式计算
服务(商务)
过程(计算)
控制(管理)
人工智能
运筹学
马尔可夫过程
工程类
机器学习
操作系统
经济
经济
统计
生物
进化生物学
数学
作者
Cheng-shuo Ying,Andy H.F. Chow,Hoa T.M. Nguyen,Kwai‐Sang Chin
标识
DOI:10.1016/j.trb.2022.05.001
摘要
This paper presents an adaptive control system for coordinated metro operations with flexible train composition by using a multi-agent deep reinforcement learning (MADRL) approach. The control problem is formulated as a Markov decision process (MDP) with multiple agents regulating different service lines in a metro network with passenger transfer. To ensure the overall computational effectiveness and stability of the control system, we adopt an actor–critic reinforcement learning framework in which each control agent is associated with a critic function for estimating future system states and an actor function deriving local operational decisions. The critics and actors in the MADRL are represented by multi-layer artificial neural networks (ANNs). A multi-agent deep deterministic policy gradient (MADDPG) algorithm is developed for training the actor and critic ANNs through successive simulated transitions over the entire metro network. The developed framework is tested with a real-world scenario in Bakerloo and Victoria Lines of London Underground, UK. Experiment results demonstrate that the proposed method can outperform previous centralized optimization and distributed control approaches in terms of solution quality and performance achieved. Further analysis shows the merits of MADRL for coordinated service regulation with flexible train composition. This study contributes to real-time coordinated metro network services with flexible train composition and advanced optimization techniques. • An adaptive rail transit control system with passengers’ transfers and flexible train composition. • A novel modeling and optimization framework based on multi-agent deep reinforcement learning. • A computational framework with ‘decentralized execution and centralized training’ for effectiveness and stability. • Case study demonstrating the system efficiency and computational effectiveness of proposed algorithm over previous methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI