端到端原则
面部识别系统
计算机科学
比例(比率)
面子(社会学概念)
人工智能
语音识别
心理学
模式识别(心理学)
地理
地图学
语言学
哲学
作者
M. Saad Shakeel,Yuxuan Zhang,Xin Wang,Wenxiong Kang,Arif Mahmood
标识
DOI:10.1016/j.jvcir.2022.103628
摘要
• A multi-scale channel attention network with an adaptive fusion strategy (MSCAN-AFF) is proposed for face recognition. • Our MSCAN-AFF guides the backbone network to pay attention to highly informative facial regions for feature extraction. • An attention-guided spatial transformer network (MSCAN-STN) is proposed that aligns attention maps in an end-to-end manner. • Extensive experiments on various benchmark datasets clearly validate the effectiveness of our proposed method. Attention modules embedded in deep networks mediate the selection of informative regions for object recognition. In addition, the combination of features learned from different branches of a network can enhance the discriminative power of these features. However, fusing features with inconsistent scales is a less-studied problem. In this paper, we first propose a multi-scale channel attention network with an adaptive feature fusion strategy (MSCAN-AFF) for face recognition (FR), which fuses the relevant feature channels and improves the network’s representational power. In FR, face alignment is performed independently prior to recognition, which requires the efficient localization of facial landmarks, which might be unavailable in uncontrolled scenarios such as low-resolution and occlusion. Therefore, we propose utilizing our MSCAN-AFF to guide the Spatial Transformer Network (MSCAN-STN) to align feature maps learned from an unaligned training set in an end-to-end manner. Experiments on benchmark datasets demonstrate the effectiveness of our proposed MSCAN-AFF and MSCAN-STN.
科研通智能强力驱动
Strongly Powered by AbleSci AI