Modality Registration and Object Search Framework for UAV-Based Unregistered RGB-T Image Salient Object Detection

计算机视觉 人工智能 计算机科学 RGB颜色模型 模态(人机交互) 透视图(图形) 目标检测 图像配准 对象(语法) 分割 图像(数学)
作者
Kechen Song,Hongwei Wen,Xiaotong Xue,Liming Huang,Yingying Ji,Yunhui Yan
出处
期刊:IEEE Transactions on Geoscience and Remote Sensing [Institute of Electrical and Electronics Engineers]
卷期号:61: 1-15 被引量:4
标识
DOI:10.1109/tgrs.2023.3332179
摘要

UAVs are widely used in various industries, and various visual tasks under the perspective of the UAV have been widely studied. In particular, the RGB-T detection method based on UAVs has shown significant advantages. However, existing RGB-T methods are designed based on registration image pairs rather than detecting images directly acquired by UAVs. This detection process is limited by the accuracy of image registration. And image registration wastes a lot of time. To solve the above problems, we construct an unregistered RGB-T image salient object detection (SOD) dataset under the UAV perspective, known as UAV RGB-T 2400. The dataset includes many challenging scenes, and the images are not manually registered. Further, we construct a modality registration and object search (MROS) framework for unregistered RGB-T SOD. Firstly, a modality registration scheme is proposed to solve the unregistration problem of modal features. We successively perform pixel-level registration from a local perspective and semantic-level registration from a global perspective for different modal features. And we carry out the channel and spatial interaction for the different modal features in modality registration. Aiming at the interference problem in the UAV detection environment, we propose an object search scheme. The two high-level features are used to search the object location, and the three low-level features are used to refine the object and produce prediction results. Experimental results on the UAV RGB-T 2400 dataset show that MROS is effective compared with state-of-the-art methods. The code is available at: https://github.com/VDT-2048/UAV-RGB-T-2400.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
风趣乐荷应助科研通管家采纳,获得30
刚刚
刚刚
无私的芹应助科研通管家采纳,获得10
刚刚
刚刚
华仔应助科研通管家采纳,获得10
刚刚
刚刚
刚刚
刚刚
华仔应助菲雨采纳,获得10
1秒前
鸭鸭酱完成签到,获得积分10
1秒前
1秒前
田様应助深情的小笼包采纳,获得10
1秒前
弟弟可不止是条狗完成签到,获得积分20
1秒前
Akim应助我先睡了采纳,获得10
1秒前
Aj的科研助手完成签到 ,获得积分10
2秒前
devilito发布了新的文献求助10
3秒前
量子星尘发布了新的文献求助10
3秒前
4秒前
DD发布了新的文献求助10
4秒前
晴朗完成签到,获得积分20
5秒前
ggjy发布了新的文献求助10
6秒前
6秒前
之_ZH完成签到 ,获得积分10
6秒前
MZ应助小可爱采纳,获得10
7秒前
打打应助晚风采纳,获得10
7秒前
Gauss完成签到,获得积分0
8秒前
8秒前
8秒前
10秒前
杨琳发布了新的文献求助10
10秒前
10秒前
斜杠小猪完成签到,获得积分10
10秒前
wutong发布了新的文献求助10
10秒前
10秒前
yao发布了新的文献求助10
10秒前
蒋蒋完成签到 ,获得积分10
11秒前
zhj完成签到,获得积分10
11秒前
李爱国应助米饭多加水采纳,获得30
12秒前
12秒前
12秒前
高分求助中
The Mother of All Tableaux Order, Equivalence, and Geometry in the Large-scale Structure of Optimality Theory 2400
Ophthalmic Equipment Market by Devices(surgical: vitreorentinal,IOLs,OVDs,contact lens,RGP lens,backflush,diagnostic&monitoring:OCT,actorefractor,keratometer,tonometer,ophthalmoscpe,OVD), End User,Buying Criteria-Global Forecast to2029 2000
A new approach to the extrapolation of accelerated life test data 1000
Cognitive Neuroscience: The Biology of the Mind 1000
Cognitive Neuroscience: The Biology of the Mind (Sixth Edition) 1000
Optimal Transport: A Comprehensive Introduction to Modeling, Analysis, Simulation, Applications 800
Official Methods of Analysis of AOAC INTERNATIONAL 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 冶金 细胞生物学 免疫学
热门帖子
关注 科研通微信公众号,转发送积分 3958929
求助须知:如何正确求助?哪些是违规求助? 3505199
关于积分的说明 11122925
捐赠科研通 3236708
什么是DOI,文献DOI怎么找? 1788949
邀请新用户注册赠送积分活动 871444
科研通“疑难数据库(出版商)”最低求助积分说明 802794