强化学习
排队
区间(图论)
计算机科学
动作(物理)
持续时间(音乐)
空格(标点符号)
信号(编程语言)
增强学习
钢筋
状态空间
数学优化
控制(管理)
控制理论(社会学)
人工智能
数学
工程类
计算机网络
统计
艺术
物理
文学类
结构工程
组合数学
量子力学
程序设计语言
操作系统
作者
Haoqing Luo,Yiming Bie,Sheng Jin
出处
期刊:IEEE Transactions on Intelligent Transportation Systems
[Institute of Electrical and Electronics Engineers]
日期:2024-01-02
卷期号:25 (6): 5225-5241
被引量:1
标识
DOI:10.1109/tits.2023.3344585
摘要
The prevailing reinforcement-learning-based traffic signal control methods are typically staging-optimizable or duration-optimizable, depending on the action spaces. In this paper, we use hybrid proximal policy optimization to synchronously optimize the stage specification and green interval duration. Under reformulated traffic demands, the intrinsic imperfections of (implementing optimization in) discrete or continuous action spaces are revealed. By comparison, hybrid action space offers a unified search space, in which our proposed method is able to better balance the trade-off between frequent switching and unsaturated release. Experiments in both single-agent and multi-agent scenarios are given to demonstrate that the proposed method reduces queue length and delay by an average of 12.72% and 11.89%, compared to the state-of-the-art RL methods. Furthermore, by calculating the Gini coefficients of right-of-way, we reveal that the proposed method does not harm fairness while improving efficiency.
科研通智能强力驱动
Strongly Powered by AbleSci AI