Deep Q-Network Based Target Search of UAV under Partially Observable Conditions

可见的 计算机科学 人工智能 物理 量子力学
作者
Sheng Nan Jiang,Sheng Wang,Subing Huang
标识
DOI:10.1109/icsece58870.2023.10263279
摘要

Autonomous target search for unmanned aerial vehicles (UAVs) has broad application scenarios in both military and civilian domains. In this paper, In this paper, a deep reinforcement learning-based autonomous target search method for the USV is proposed which incorporates the recurrent neural network to achieve accurate decision-making under partially observable conditions. In our study, firstly, we model the UAV's target search task as a partially observable Markov decision process problem and use the Deep Q-Network(DQN) to evaluate the current state and generate actions. Secondly, the gated recurrent unit (GRU) is introduced to mitigate the decision bias that occurs in the UAV under incompletely observable conditions and to achieve a mapping from incomplete to more complete perceptual information of the UAV at the current moment. Finally, to avoid the problem of slow convergence of the algorithm caused by sparse rewards, we combine prior knowledge to design reward functions suitable for the UAV target tracking task to facilitate faster convergence of the algorithm. To verify the effectiveness of the algorithm, we designed an experimental environment for UAV target search based on a 3D simulation platform. The experimental results show that our method enables the UAV to successfully search for targets in less time for the same training episodes.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
铱铂发布了新的文献求助10
1秒前
1秒前
karaha发布了新的文献求助10
1秒前
1秒前
1秒前
1秒前
在水一方应助康帅傅采纳,获得10
1秒前
2秒前
2秒前
2秒前
内向面包完成签到,获得积分10
2秒前
3秒前
勤劳凝梦完成签到,获得积分20
4秒前
又又发布了新的文献求助10
4秒前
冷酷保温杯完成签到,获得积分10
4秒前
此时此刻发布了新的文献求助10
4秒前
小美人余完成签到,获得积分10
5秒前
Mountain发布了新的文献求助10
5秒前
哇哈哈发布了新的文献求助10
5秒前
植物碱发布了新的文献求助10
5秒前
jiangnan发布了新的文献求助10
6秒前
6秒前
6秒前
勤劳凝梦发布了新的文献求助10
6秒前
7秒前
清风明月完成签到,获得积分10
7秒前
SciGPT应助危机的灵阳采纳,获得10
8秒前
yw完成签到,获得积分10
8秒前
8秒前
8秒前
9秒前
9秒前
microbiome121完成签到,获得积分10
10秒前
10秒前
10秒前
10秒前
10秒前
wxj发布了新的文献求助10
11秒前
科研通AI6.3应助不安乐菱采纳,获得10
12秒前
12秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Picture this! Including first nations fiction picture books in school library collections 2000
The Composition and Relative Chronology of Dynasties 16 and 17 in Egypt 1500
Cowries - A Guide to the Gastropod Family Cypraeidae 1200
ON THE THEORY OF BIRATIONAL BLOWING-UP 666
Signals, Systems, and Signal Processing 610
“美军军官队伍建设研究”系列(全册) 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6386125
求助须知:如何正确求助?哪些是违规求助? 8199768
关于积分的说明 17345640
捐赠科研通 5439809
什么是DOI,文献DOI怎么找? 2876741
邀请新用户注册赠送积分活动 1853238
关于科研通互助平台的介绍 1697314