计算机科学
强化学习
马尔可夫决策过程
过程(计算)
熵(时间箭头)
最大熵原理
人工智能
博弈论
马尔可夫过程
机器学习
模拟
数学
数理经济学
统计
物理
量子力学
操作系统
作者
Wenli Li,Fanke Qiu,Lingxi Li,Yinan Zhang,Kan Wang
出处
期刊:IEEE transactions on intelligent vehicles
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:9 (1): 1079-1093
标识
DOI:10.1109/tiv.2023.3323138
摘要
Simulation testing based on virtual scenarios can improve the efficiency of safety testing for high-level autonomous vehicles (AVs). In most traffic scenarios, such as merging scenarios, the interactions between vehicles are a game process. Therefore, a critical factor is to accurately simulate the game and interaction processes between the background vehicle (BV) and AV in the test environment. With the increasing availability of natural driving data, a data-driven approach can be introduced to identify the underlying driving behavior patterns in actual driving data. Thus, this paper proposes a data-driven method for modeling BV behavior for AV testing in virtual scenarios. The method describes the vehicle decision process in the merging scenario as a standard Markov decision process (MDP). Based on game theory, we considered the BV as a game subject to illustrate the vehicle interaction process. Furthermore, a deep maximum entropy-inverse reinforcement learning combined with the game matrix is proposed to identify the reward function that describes BV behavior. The obtained reward function is used to design a deep Q-network algorithm to simulate the behavior of BV. Finally, the effectiveness and feasibility of the proposed method are verified by comparing it with natural driving data. Moreover, we performed comparative tests with the other two baseline methods; the results show that the proposed method can accurately simulate the interaction behaviors between vehicles in the virtual scenarios.
科研通智能强力驱动
Strongly Powered by AbleSci AI