亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Distributed multi-robot collision avoidance via deep reinforcement learning for navigation in complex scenarios

强化学习 机器人 避碰 稳健性(进化) 计算机科学 人工智能 一般化 分布式计算 碰撞 计算机安全 数学分析 生物化学 化学 数学 基因
作者
Tingxiang Fan,Pinxin Long,Wenxi Liu,Jia Pan
出处
期刊:The International Journal of Robotics Research [SAGE Publishing]
卷期号:39 (7): 856-892 被引量:313
标识
DOI:10.1177/0278364920916531
摘要

Developing a safe and efficient collision-avoidance policy for multiple robots is challenging in the decentralized scenarios where each robot generates its paths with limited observation of other robots’ states and intentions. Prior distributed multi-robot collision-avoidance systems often require frequent inter-robot communication or agent-level features to plan a local collision-free action, which is not robust and computationally prohibitive. In addition, the performance of these methods is not comparable with their centralized counterparts in practice. In this article, we present a decentralized sensor-level collision-avoidance policy for multi-robot systems, which shows promising results in practical applications. In particular, our policy directly maps raw sensor measurements to an agent’s steering commands in terms of the movement velocity. As a first step toward reducing the performance gap between decentralized and centralized methods, we present a multi-scenario multi-stage training framework to learn an optimal policy. The policy is trained over a large number of robots in rich, complex environments simultaneously using a policy-gradient-based reinforcement-learning algorithm. The learning algorithm is also integrated into a hybrid control framework to further improve the policy’s robustness and effectiveness. We validate the learned sensor-level collision-3avoidance policy in a variety of simulated and real-world scenarios with thorough performance evaluations for large-scale multi-robot systems. The generalization of the learned policy is verified in a set of unseen scenarios including the navigation of a group of heterogeneous robots and a large-scale scenario with 100 robots. Although the policy is trained using simulation data only, we have successfully deployed it on physical robots with shapes and dynamics characteristics that are different from the simulated agents, in order to demonstrate the controller’s robustness against the simulation-to-real modeling error. Finally, we show that the collision-avoidance policy learned from multi-robot navigation tasks provides an excellent solution for safe and effective autonomous navigation for a single robot working in a dense real human crowd. Our learned policy enables a robot to make effective progress in a crowd without getting stuck. More importantly, the policy has been successfully deployed on different types of physical robot platforms without tedious parameter tuning. Videos are available at https://sites.google.com/view/hybridmrca .
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
科研通AI2S应助科研通管家采纳,获得10
31秒前
Orange应助科研通管家采纳,获得10
31秒前
37秒前
壮观的谷冬完成签到 ,获得积分0
56秒前
po完成签到,获得积分20
1分钟前
量子星尘发布了新的文献求助10
2分钟前
Ocean完成签到,获得积分10
2分钟前
yaoqi完成签到,获得积分10
2分钟前
刘唯完成签到 ,获得积分10
3分钟前
3分钟前
Arw发布了新的文献求助10
3分钟前
FeelingUnreal完成签到,获得积分10
4分钟前
GHOSTagw完成签到,获得积分10
4分钟前
烟花应助搞科研的肥宅吴采纳,获得30
4分钟前
荀煜祺完成签到,获得积分10
4分钟前
4分钟前
4分钟前
CipherSage应助cqhecq采纳,获得10
4分钟前
4分钟前
艾米发布了新的文献求助10
4分钟前
NexusExplorer应助艾米采纳,获得10
5分钟前
5分钟前
cqhecq发布了新的文献求助10
5分钟前
希望天下0贩的0应助cqhecq采纳,获得30
5分钟前
shonichev发布了新的文献求助10
6分钟前
6分钟前
Anthocyanidin完成签到,获得积分10
7分钟前
friend516完成签到 ,获得积分10
7分钟前
勾勾完成签到 ,获得积分10
8分钟前
赘婿应助Narcissus153采纳,获得10
9分钟前
qss753发布了新的文献求助10
9分钟前
急诊守夜人完成签到 ,获得积分10
9分钟前
开放冰香完成签到 ,获得积分10
9分钟前
迪子完成签到 ,获得积分10
9分钟前
10分钟前
10分钟前
爱听歌小兔子完成签到,获得积分10
10分钟前
BowieHuang应助科研通管家采纳,获得10
10分钟前
BowieHuang应助科研通管家采纳,获得10
10分钟前
矜天完成签到 ,获得积分10
11分钟前
高分求助中
Entre Praga y Madrid: los contactos checoslovaco-españoles (1948-1977) 1000
Polymorphism and polytypism in crystals 1000
Signals, Systems, and Signal Processing 610
Discrete-Time Signals and Systems 610
Horngren's Cost Accounting A Managerial Emphasis 17th edition 600
Russian Politics Today: Stability and Fragility (2nd Edition) 500
Death Without End: Korea and the Thanatographics of War 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 纳米技术 有机化学 物理 生物化学 化学工程 计算机科学 复合材料 内科学 催化作用 光电子学 物理化学 电极 冶金 遗传学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 6086864
求助须知:如何正确求助?哪些是违规求助? 7916482
关于积分的说明 16377089
捐赠科研通 5220032
什么是DOI,文献DOI怎么找? 2790822
邀请新用户注册赠送积分活动 1773998
关于科研通互助平台的介绍 1649615