计算机科学
马尔可夫决策过程
强化学习
过程(计算)
人工智能
模仿
对抗制
领域(数学)
生成语法
机器学习
动作(物理)
人机交互
马尔可夫过程
统计
纯数学
物理
操作系统
社会心理学
量子力学
数学
心理学
作者
Melik Bugra Ozcelik,Berk Agin,Ozan Çaldıran,Omer Sirin
标识
DOI:10.1109/asyu58738.2023.10296611
摘要
Autonomous driving has gained significant attention as a rapidly advancing research field in recent years. This paper addresses the challenge of efficient highway driving by formulating it as a Markov Decision Process (MDP) and leveraging Reinforcement Learning (RL) techniques. To tackle the decision-making problem, Generative Adversarial Imitation Learning (GAIL) is employed to imitate expert behavior. In this study, we acquired expert data by running an agent within the simulation, enabling us to gather a larger volume of data. Then, the expert agent is trained by Deep Q-Network (DQN) using reward shaping. Our contribution in this study involves the application of Curriculum Learning (CL) specifically to highway scenarios, gradually increasing the complexity of the traffic environment to enhance the training process. In order to achieve a behavior comparable to that of a human driver, we proposed the use of the GAIL approach specifically for highway scenarios, aiming to increase the diversity of state and action pairs. Our experiments successfully demonstrated the effectiveness of this approach, as the autonomous driving agent effectively imitated expert behavior and achieved outstanding collision-free performance on highways.
科研通智能强力驱动
Strongly Powered by AbleSci AI