计算机科学
特征(语言学)
棱锥(几何)
人工智能
频道(广播)
卷积神经网络
分割
编码(内存)
模式识别(心理学)
解码方法
图层(电子)
计算机视觉
算法
电信
物理
哲学
光学
语言学
有机化学
化学
作者
Xiuli Bi,Jinwu Hu,Bin Xiao,Weisheng Li,Xinbo Gao
出处
期刊:IEEE Transactions on Big Data
[Institute of Electrical and Electronics Engineers]
日期:2023-04-01
卷期号:9 (2): 688-700
被引量:11
标识
DOI:10.1109/tbdata.2022.3187413
摘要
The instance segmentation task is relatively difficult in computer vision, which requires not only high-quality masks but also high-accuracy instance category classification. Mask R-CNN has been proven to be a feasible method. However, due to the Feature Pyramid Network (FPN) structure lack useful channel information, global information and low-level texture information, and mask branch cannot obtain useful local-global information, Mask R-CNN is prevented from obtaining high-quality masks and high-accuracy instance category classification. Therefore, we proposed the Information-enhanced Mask R-CNN, called IEMask R-CNN. In the FPN structure of IEMask R-CNN, the information-enhanced FPN will enhance the useful channel information and the global information of the feature maps to solve the issues that the high-level feature map loses useful channel information and inaccurate of instance category classification, meanwhile the bottom-up path enhancement with adaptive feature fusion will ultilize the precise positioning signal in the lower layer to enhance the feature pyramid. In the mask branch of IEMask R-CNN, an encoding-decoding mask head will strength local-global information to gain a high-quality mask. Without bells and whistles, IEMask R-CNN gains significant gains of about 2.60%, 4.00%, 3.17% over Mask R-CNN on MS COCO2017, Cityscapes and LVIS1.0 benchmarks respectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI