发布文献求助

Multi-Skill Policy Transfer by Option-based Deep Reinforcement Learning for Autonomous Driving

强化学习计算机科学学习迁移传输（计算）人工智能并行计算

作者

Bo Wei,Jianxin Zhao,Yinuo Zhao,Feng Tian

标识

DOI：10.1109/bigcom61073.2023.00045

摘要

Autonomous Driving presents a promising solution to the issue of road accidents, which are mostly caused by human errors. The use of artificial intelligence technologies in this field has resulted in significant advancements in tasks such as object detection, path planning, and obstacle avoidance, leading to safer and more efficient transportation. Reinforcement learning (RL) is a powerful machine learning algorithm that has demonstrated effectiveness in various autonomous driving applications. However, the vanilla single RL policy is inadequate when faced with more complex transportation scenarios involving heavy and dynamic traffic. In this paper, we propose a novel OPtion-based multi-skill policy Transfer method with deep RL for autonomous driving, called "Opt-RL", to learn a more complex target policy by integrating basic skills from multiple source policies. An adaptive option learning module is designed to efficiently use learned skills in higher-level target domains, determining when and where to distil policies from different sources. We conduct experiments on challenging tasks in the Mujoco Maze2D benchmark and a simulated highway environment. Experimental results demonstrate that Opt-RL can achieve knowledge transfer among different levels of policies and successfully train a complex high-level decision-making policy by reasonably integrating multiple basic skills; it also achieves a longer safe driving distance 16% higher than the baseline DQN.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2024年影响因子查询已上线 (2024-6-20)

更新

大幅提高文件上传限制，最高150M (2024-4-1)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 煤球完成签到，获得积分10

1秒前; 快乐零零屋发布了新的文献求助10

2秒前; Olivia发布了新的文献求助30

3秒前; 无牙完成签到，获得积分10

3秒前; 蜗牛先生完成签到，获得积分10

3秒前; 完美世界的应助被Perrylin718采纳，获得10

4秒前; chaoshen完成签到，获得积分10

6秒前; 传奇3上传了应助文件

9秒前; 调皮傲易完成签到，获得积分10

9秒前; 恰同学少年发布了新的文献求助10

10秒前; 果果完成签到，获得积分10

11秒前; 科研通AI2.0上传了应助文件

12秒前; 荣荣发布了新的文献求助10

13秒前; 科研通AI2S的应助被chouchou采纳，获得10

13秒前; 彩虹猫完成签到，获得积分10

13秒前; 李爱国的应助被阿科采纳，获得10

15秒前; 快乐零零屋完成签到，获得积分10

16秒前; 宜醉宜游宜睡的应助被爱吃棒棒糖的猫采纳，获得10

18秒前; cristin完成签到，获得积分10

18秒前; Aiden完成签到，获得积分10

18秒前; 果果发布了新的文献求助10

18秒前; 宾师傅完成签到，获得积分10

19秒前; 独特的忆彤完成签到，获得积分10

22秒前; 李爱国上传了应助文件

22秒前; 阿科完成签到，获得积分20

23秒前; 斯文败类上传了应助文件

25秒前; 脑洞疼上传了应助文件

25秒前; orixero上传了应助文件

25秒前; Coward完成签到，获得积分10

26秒前; 顺利平文发布了新的文献求助10

29秒前; 玄同发布了新的文献求助20

30秒前; Capacition6发布了新的文献求助10

30秒前; foolishbear完成签到，获得积分10

31秒前; 我们都爱绿皮书上传了应助文件

32秒前; 凹凸曼打小怪兽发布了新的文献求助10

32秒前; 皓月星辰关闭了皓月星辰的文献求助

33秒前; ys6完成签到，获得积分10

34秒前; 爱吃棒棒糖的猫完成签到，获得积分20

34秒前; 顺利平文完成签到，获得积分10

36秒前; 方语蕊完成签到，获得积分10

38秒前

高分求助中: Impact of Mitophagy-Related Genes on the Diagnosis and Development of Esophageal Squamous Cell Carcinoma via Single-Cell RNA-seq Analysis and Machine Learning Algorithms 2000; Evolution 1500; How to Create Beauty: De Lairesse on the Theory and Practice of Making Art 1000; Gerard de Lairesse : an artist between stage and studio 670; CLSI EP47 Evaluation of Reagent Carryover Effects on Test Results, 1st Edition 550; Decision Theory 500; Multiscale Thermo-Hydro-Mechanics of Frozen Soil: Numerical Frameworks and Constitutive Models 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 2988017; 求助须知：如何正确求助？哪些是违规求助？ 2649071; 关于积分的说明 7157302; 捐赠科研通 2283096; 什么是DOI，文献DOI怎么找？ 1210513; 版权声明 592454; 科研通“疑难数据库（出版商）”最低求助积分说明 591139

今日热心研友

干净的铅笔

宜醉宜游宜睡

坚强的广山

账户已注销

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2024 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：826996720【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通