计算机科学
分割
人工智能
编码器
虹膜识别
模式识别(心理学)
卷积神经网络
变压器
计算机视觉
生物识别
物理
量子力学
电压
操作系统
作者
Ye Sun,Yinan Lu,Yuanning Liu,Xiaodong Zhu
标识
DOI:10.1109/ijcb54206.2022.10007944
摘要
Iris images captured in less-constrained environments often suffer from adverse noise, challenging many existing segmentation algorithms. In this paper, we propose an efficient Hybrid Transformer U-Net (HTU-Net) to address this dilemma. Unlike previous studies that only focus on utilizing popular CNN technology to predict iris masks accurately, HTU-Net can simultaneously obtain segmentation masks and parameterized pupillary and limbic boundaries by a multi-task network, further enabling CNN-based iris segmentation to be applied in any regular iris recognition systems. We explore the application of the Transformer in iris segmentation and propose a hybrid encoder that employs convolutional layers to extract local intensity features and the Transformer to capture long-range associative information. For decoding, we adopt a novel Multi-Head Dilated Attention to exploit the multi-scale contextual information by gating mechanism, thus emphasizing the important features and rendering powerful representations. Inspired by the consistent class characteristics of iris, we further devise a Pyramid Center-Aware Module to capture the global structural context of iris from a categorical perspective to improve performance. Experimental results show that our method, with fewer parameters than previous approaches, achieves competitive or new state-of-the-art performance in both iris segmentation and localization on three challenging iris datasets. Code will be released at https://github.com/Syloveslife/HTU-Net.
科研通智能强力驱动
Strongly Powered by AbleSci AI