Multi-agent deep reinforcement learning for hyperspectral band selection with hybrid teacher guide

高光谱成像 强化学习 选择(遗传算法) 人工智能 钢筋 计算机科学 机器学习 心理学 社会心理学
作者
Jie Feng,Qiyang Gao,Ronghua Shang,Xianghai Cao,Gaiqin Bai,Xiangrong Zhang,Licheng Jiao
出处
期刊:Knowledge Based Systems [Elsevier BV]
卷期号:299: 112044-112044 被引量:9
标识
DOI:10.1016/j.knosys.2024.112044
摘要

Due to the presence of noisy and highly redundant bands in hyperspectral images (HSIs), band selection serves as a key preprocessing for downstream classification tasks. Recently, deep reinforcement learning (DRL) has been developed as a new trend for band selection of HSIs. Existing DRL-based methods often adopt single-agent, which are prone to fall into local optima due to an excessive action space. The multi-agent methods provide a feasible solution, but often require too much computation. To address these problems, a novel multi-agent DRL method with hybrid teacher guide (MH-DRL) is proposed for band selection of HSIs. In MH-DRL, each agent corresponding to a spectral band decides whether this band is selected. Moreover, a presentation-evaluation network (PE-Net) is constructed to design the reward by evaluating the candidate band subsets without any fine-tuning and represent the state by extracting the spatial-spectral features of HSIs. Then, three kinds of experienced band selection models are regarded as the teachers and designed to participate in the band exploration of DRL, which can improve the learning effectiveness and efficiency by accumulating the external knowledge from diverse teacher models. Finally, deep Q-learning algorithm is designed to update the agents and improve their self-learning ability from continuous exploration. Experimental results on three widely-used HSI data verify the performance of the proposed method better than some advanced band selection algorithms of HSIs.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
1秒前
qurent完成签到,获得积分20
3秒前
SUNstp发布了新的文献求助10
3秒前
可靠寒云发布了新的文献求助10
4秒前
柠檬01210完成签到,获得积分20
5秒前
孙颖完成签到 ,获得积分10
5秒前
7秒前
星辰发布了新的文献求助10
7秒前
9秒前
赫利完成签到,获得积分10
11秒前
SUNstp完成签到,获得积分10
11秒前
再睡十分钟完成签到,获得积分10
11秒前
11秒前
luojinjin完成签到,获得积分10
13秒前
000发布了新的文献求助10
13秒前
13秒前
传奇3应助马克采纳,获得10
14秒前
josiko发布了新的文献求助10
15秒前
Hello应助火星上的海亦采纳,获得10
15秒前
Orange应助柠檬01210采纳,获得10
15秒前
16秒前
无花果应助zpctx采纳,获得10
16秒前
16秒前
miao发布了新的文献求助10
18秒前
Ava应助SODAPIE采纳,获得10
19秒前
秦桂敏完成签到 ,获得积分10
20秒前
bbzhang发布了新的文献求助10
20秒前
hdjdb完成签到 ,获得积分10
21秒前
21秒前
完美世界应助zsg11067采纳,获得10
23秒前
24秒前
24秒前
爽朗的小王同学完成签到,获得积分10
24秒前
26秒前
Budowen发布了新的文献求助10
27秒前
细胞在江山在完成签到 ,获得积分10
28秒前
星辰完成签到,获得积分10
28秒前
Ava应助zpctx采纳,获得10
28秒前
钟鱼发布了新的文献求助10
29秒前
高分求助中
Overcoming Stigma and Bias in Obesity Management 800
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Materials selection in mechanical design 500
Bounds for Statistical Estimation in Semiparametric Models 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Ideology and Meaning-Making under the Putin Regime 450
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6481779
求助须知:如何正确求助?哪些是违规求助? 8282108
关于积分的说明 17664936
捐赠科研通 5565904
什么是DOI,文献DOI怎么找? 2911942
邀请新用户注册赠送积分活动 1889071
关于科研通互助平台的介绍 1744140