Collision Avoidance in Pedestrian-Rich Environments With Deep Reinforcement Learning

强化学习 计算机科学 避碰 启发式 机器人 人工智能 行人 碰撞 自主代理人 增强学习 机器学习 计算机安全 操作系统 运输工程 工程类
作者
Michael Everett,Yu Fan Chen,Jonathan P. How
出处
期刊:IEEE Access [Institute of Electrical and Electronics Engineers]
卷期号:9: 10357-10377 被引量:206
标识
DOI:10.1109/access.2021.3050338
摘要

Collision avoidance algorithms are essential for safe and efficient robot\noperation among pedestrians. This work proposes using deep reinforcement (RL)\nlearning as a framework to model the complex interactions and cooperation with\nnearby, decision-making agents, such as pedestrians and other robots. Existing\nRL-based works assume homogeneity of agent properties, use specific motion\nmodels over short timescales, or lack a principled method to handle a large,\npossibly varying number of agents. Therefore, this work develops an algorithm\nthat learns collision avoidance among a variety of heterogeneous,\nnon-communicating, dynamic agents without assuming they follow any particular\nbehavior rules. It extends our previous work by introducing a strategy using\nLong Short-Term Memory (LSTM) that enables the algorithm to use observations of\nan arbitrary number of other agents, instead of a small, fixed number of\nneighbors. The proposed algorithm is shown to outperform a classical collision\navoidance algorithm, another deep RL-based algorithm, and scales with the\nnumber of agents better (fewer collisions, shorter time to goal) than our\npreviously published learning-based approach. Analysis of the LSTM provides\ninsights into how observations of nearby agents affect the hidden state and\nquantifies the performance impact of various agent ordering heuristics. The\nlearned policy generalizes to several applications beyond the training\nscenarios: formation control (arrangement into letters), demonstrations on a\nfleet of four multirotors and on a fully autonomous robotic vehicle capable of\ntraveling at human walking speed among pedestrians.\n
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
BenLuo完成签到,获得积分10
刚刚
生动白开水完成签到,获得积分10
刚刚
0mmm完成签到,获得积分10
1秒前
key完成签到,获得积分10
2秒前
2秒前
Akim应助tly采纳,获得10
3秒前
猫和老鼠发布了新的文献求助10
3秒前
星辰大海应助柔情公蚂蚁采纳,获得10
4秒前
852应助PM采纳,获得10
4秒前
wwwewqe完成签到 ,获得积分10
4秒前
Carrie发布了新的文献求助10
5秒前
量子星尘发布了新的文献求助10
5秒前
6秒前
6秒前
搜集达人应助1337采纳,获得10
6秒前
zhanglh123完成签到,获得积分10
6秒前
yyyyy应助月亮与六便士采纳,获得10
7秒前
CipherSage应助自由的明雪采纳,获得10
7秒前
8秒前
9秒前
zerolly完成签到,获得积分10
10秒前
乐空思应助sirhai采纳,获得50
11秒前
小孟吖完成签到 ,获得积分10
11秒前
12秒前
13秒前
13秒前
瘦瘦鼠标发布了新的文献求助10
13秒前
fffff生发布了新的文献求助20
13秒前
Ainra发布了新的文献求助10
13秒前
tly完成签到,获得积分10
14秒前
15秒前
candidate发布了新的文献求助10
15秒前
16秒前
Anatee完成签到,获得积分10
17秒前
17秒前
可爱的函函应助trial采纳,获得10
19秒前
szr发布了新的文献求助10
19秒前
20秒前
20秒前
沉默的稀完成签到,获得积分10
20秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Burger's Medicinal Chemistry, Drug Discovery and Development, Volumes 1 - 8, 8 Volume Set, 8th Edition 1800
Cronologia da história de Macau 1600
Contemporary Debates in Epistemology (3rd Edition) 1000
International Arbitration Law and Practice 1000
文献PREDICTION EQUATIONS FOR SHIPS' TURNING CIRCLES或期刊Transactions of the North East Coast Institution of Engineers and Shipbuilders第95卷 1000
BRITTLE FRACTURE IN WELDED SHIPS 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 纳米技术 计算机科学 化学工程 生物化学 物理 复合材料 内科学 催化作用 物理化学 光电子学 细胞生物学 基因 电极 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6163386
求助须知:如何正确求助?哪些是违规求助? 7991276
关于积分的说明 16615377
捐赠科研通 5270833
什么是DOI,文献DOI怎么找? 2812166
邀请新用户注册赠送积分活动 1792227
关于科研通互助平台的介绍 1658469