亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Design and Experimental Validation of Deep Reinforcement Learning-Based Fast Trajectory Planning and Control for Mobile Robot in Unknown Environment

航路点 强化学习 计算机科学 弹道 人工智能 深度学习 人工神经网络 运动规划 移动机器人 任务(项目管理) 机器人 实时计算 机器学习 模拟 工程类 物理 系统工程 天文
作者
Runqi Chai,Hanlin Niu,Joaquín Carrasco,Farshad Arvin,Hujun Yin,Barry Lennox
出处
期刊:IEEE transactions on neural networks and learning systems [Institute of Electrical and Electronics Engineers]
卷期号:35 (4): 5778-5792 被引量:277
标识
DOI:10.1109/tnnls.2022.3209154
摘要

This article is concerned with the problem of planning optimal maneuver trajectories and guiding the mobile robot toward target positions in uncertain environments for exploration purposes. A hierarchical deep learning-based control framework is proposed which consists of an upper level motion planning layer and a lower level waypoint tracking layer. In the motion planning phase, a recurrent deep neural network (RDNN)-based algorithm is adopted to predict the optimal maneuver profiles for the mobile robot. This approach is built upon a recently proposed idea of using deep neural networks (DNNs) to approximate the optimal motion trajectories, which has been validated that a fast approximation performance can be achieved. To further enhance the network prediction performance, a recurrent network model capable of fully exploiting the inherent relationship between preoptimized system state and control pairs is advocated. In the lower level, a deep reinforcement learning (DRL)-based collision-free control algorithm is established to achieve the waypoint tracking task in an uncertain environment (e.g., the existence of unexpected obstacles). Since this approach allows the control policy to directly learn from human demonstration data, the time required by the training process can be significantly reduced. Moreover, a noisy prioritized experience replay (PER) algorithm is proposed to improve the exploring rate of control policy. The effectiveness of applying the proposed deep learning-based control is validated by executing a number of simulation and experimental case studies. The simulation result shows that the proposed DRL method outperforms the vanilla PER algorithm in terms of training speed. Experimental videos are also uploaded, and the corresponding results confirm that the proposed strategy is able to fulfill the autonomous exploration mission with improved motion planning performance, enhanced collision avoidance ability, and less training time.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
小梨子完成签到,获得积分10
3秒前
王禹恒发布了新的文献求助10
4秒前
molihuakai应助乔沃维奇采纳,获得10
4秒前
4秒前
FashionBoy应助王禹恒采纳,获得10
8秒前
10秒前
Dr_Fang完成签到 ,获得积分10
14秒前
舒鑫发布了新的文献求助10
19秒前
Rosen完成签到 ,获得积分10
19秒前
fly赖赖赖完成签到,获得积分10
21秒前
dean完成签到,获得积分10
24秒前
Orange应助曾经的人雄采纳,获得10
27秒前
瞌睡虫子完成签到 ,获得积分10
28秒前
藏沙发布了新的文献求助20
29秒前
29秒前
舒服的豪英完成签到,获得积分10
34秒前
camile发布了新的文献求助10
34秒前
35秒前
xinyi完成签到 ,获得积分10
39秒前
腾空星完成签到 ,获得积分10
41秒前
43秒前
45秒前
王禹恒发布了新的文献求助10
48秒前
精明金毛应助科研通管家采纳,获得10
50秒前
精明金毛应助科研通管家采纳,获得10
50秒前
丘比特应助科研通管家采纳,获得10
50秒前
50秒前
乐乐应助科研通管家采纳,获得10
50秒前
Laputa发布了新的文献求助10
51秒前
丘比特应助王禹恒采纳,获得10
52秒前
南浅完成签到 ,获得积分10
52秒前
hix258完成签到,获得积分10
53秒前
582843216完成签到,获得积分10
53秒前
领导范儿应助cgc采纳,获得10
55秒前
58秒前
悦耳冰香完成签到,获得积分10
1分钟前
1分钟前
wab完成签到,获得积分0
1分钟前
FashionBoy应助bb采纳,获得10
1分钟前
热带蚂蚁完成签到 ,获得积分10
1分钟前
高分求助中
Ideology and Meaning-Making under the Putin Regime 750
Prompt Engineering for Clinicians: Harnessing AI in Everyday Medical Practice 600
Handbook of Luminescence Dating 500
Safety Pharmacology 500
《KNN基无铅压电陶瓷电学性能优化与物理机理研究》 500
Introduction to Industrial/Organizational Psychology 400
Advances in Design and Control Robust Adaptive Control: Deadzone-Adapted Disturbance Suppression 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 计算机科学 化学工程 生物化学 物理 内科学 复合材料 催化作用 光电子学 物理化学 电极 细胞生物学 基因 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6926852
求助须知:如何正确求助?哪些是违规求助? 8615514
关于积分的说明 18276608
捐赠科研通 6347214
什么是DOI,文献DOI怎么找? 3072166
关于科研通互助平台的介绍 2105335
邀请新用户注册赠送积分活动 2049310