DKTNet: Dual-Key Transformer Network for small object detection

计算机科学 变压器 计算 人工智能 卷积(计算机科学) 模式识别(心理学) 目标检测 算法 人工神经网络 电压 量子力学 物理
作者
Shoukun Xu,Jianan Gu,Yining Hua,Yi Liu
出处
期刊:Neurocomputing [Elsevier]
卷期号:525: 29-41 被引量:51
标识
DOI:10.1016/j.neucom.2023.01.055
摘要

Object detection is a fundamental computer vision task that plays a crucial role in a wide range of real-world applications. However, it is still a challenging task to detect the small size objects in the complex scene, due to the low resolution and noisy representation appearance caused by occlusion, distant depth view, etc. To tackle this issue, a novel transformer architecture, Dual-Key Transformer Network (DKTNet), is proposed in this paper. To improve the feature attention ability, the coherence of linear layer outputs Q and V are enhanced by a dual-K integrated from K1 and K2, which are computed along Q and V, respectively. Instead of spatial-wise attention, channel-wise self-attention mechanism is adopted to promote the important feature channels and suppress the confusing ones. Moreover, 2D and 1D convolution computations for Q, K and V are proposed. Compared with the fully-connected computation in conventional transformer architectures, the 2D convolution can better capture local details and global contextual information, and the 1D convolution can reduce network complexity significantly. Experimental evaluation is conducted on both general and small object detection datasets. The superiority of the aforementioned features in our proposed approach is demonstrated with the comparison against the state-of-the-art approaches.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
hao完成签到,获得积分10
刚刚
1秒前
研友_VZG7GZ应助火星上不尤采纳,获得10
1秒前
Genetrix应助笑点低的酒窝采纳,获得10
1秒前
cling完成签到,获得积分20
1秒前
1秒前
2秒前
2秒前
立志躺平发布了新的文献求助10
2秒前
syh5527029完成签到,获得积分10
2秒前
LX发布了新的文献求助10
2秒前
寒江雪应助daidai采纳,获得150
2秒前
yyyyy完成签到,获得积分10
2秒前
3秒前
木瓜完成签到,获得积分10
3秒前
3秒前
4秒前
4秒前
4秒前
现代的bb完成签到,获得积分10
5秒前
5秒前
5秒前
5秒前
期待完成签到,获得积分10
5秒前
欣欣完成签到,获得积分10
5秒前
6秒前
Promise完成签到,获得积分10
6秒前
6秒前
小二郎应助光亮的元龙采纳,获得10
6秒前
五五发布了新的文献求助10
6秒前
6秒前
6秒前
李键刚完成签到,获得积分10
6秒前
药企牛马发布了新的文献求助30
6秒前
完美世界应助chc采纳,获得10
7秒前
8秒前
完美世界应助xxl采纳,获得10
8秒前
8秒前
欣欣发布了新的文献求助10
8秒前
8秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Kinesiophobia : a new view of chronic pain behavior 3000
Les Mantodea de guyane 2500
Molecular Biology of Cancer: Mechanisms, Targets, and Therapeutics 2000
Standard: In-Space Storable Fluid Transfer for Prepared Spacecraft (AIAA S-157-2024) 1000
What is the Future of Psychotherapy in a Digital Age? 700
Signals, Systems, and Signal Processing 510
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 计算机科学 有机化学 物理 生物化学 纳米技术 复合材料 内科学 化学工程 人工智能 催化作用 遗传学 数学 基因 量子力学 物理化学
热门帖子
关注 科研通微信公众号,转发送积分 5953452
求助须知:如何正确求助?哪些是违规求助? 7158122
关于积分的说明 15930998
捐赠科研通 5088173
什么是DOI,文献DOI怎么找? 2734742
邀请新用户注册赠送积分活动 1695610
关于科研通互助平台的介绍 1616930