块(置换群论)
人工智能
计算机科学
特征(语言学)
特征提取
计算机视觉
杂乱
模式识别(心理学)
目标检测
传输(电信)
支持向量机
雷达
数学
电信
哲学
语言学
几何学
作者
Hyeyeon Choi,Jong Pil Yun,Bum Jun Kim,Hyeonah Jang,Sang Woo Kim
出处
期刊:IEEE Transactions on Industrial Informatics
[Institute of Electrical and Electronics Engineers]
日期:2022-02-01
卷期号:18 (11): 7686-7695
被引量:31
标识
DOI:10.1109/tii.2022.3147833
摘要
Transmission line (TL) inspection is important for ensuring a stable supply of electricity to rural areas. Currently, there are several TL detection approaches based on computer vision; however, they have limitations owing to background clutter in visible light images. This article presents a novel multimodal image feature fusion module that utilizes both visible light and infrared images to enhance the TL-detection performance. The proposed module consists of a multibranch feature extraction (MFE) block followed by a channelwise attention (CA) block. The first block extracts the representative features of each modal input using multiple branches. The outputs of the MFE block are jointly aggregated into an attention vector in the CA block. Finally, the attention vector recalibrates each input feature of the proposed module. To reduce the number of additional parameters due to the insertion of the module, we introduced a channel-shrink factor in the MFE block and utilized a $1\times {1}$ convolution in the CA block. Comparison experiments with various augmented conditions of day, night, fog, and snow were conducted on a real-world dataset, which we constructed by visible light and infrared images. The results showed that the proposed module outperformed not only the case of single modal input but also the state-of-the-art fusion methods, regardless of the baseline networks. Additionally, the proposed module showed effectiveness in terms of capacity when the baseline network has a large number of weight parameters.
科研通智能强力驱动
Strongly Powered by AbleSci AI