小学生
人工智能
计算机科学
计算机视觉
稳健性(进化)
变压器
像素
编码器
机器视觉
模式识别(心理学)
工程类
电压
光学
生物化学
化学
物理
电气工程
基因
操作系统
作者
Li Wang,Changyuan Wang,Yu Zhang
标识
DOI:10.1142/s0218001422550163
摘要
Pupil detection is an indispensable part of the process of eye-tracking. Due of the limitation of existing methods on pupil image quality, we propose a pupil detection method using vision transformer with a hybrid structure. We first extract the local features of the image with CNN, and then obtain the global dependence through the encoder of the transformer, to excavate more accurate information on pupil position. We trained and tested the proposed model on 10 600 images from three publicly available datasets and compared with other pupil detection models. The analysis of the outcomes demonstrated that the hybrid vision transformer was superior to these comparison approaches in terms of accuracy and robustness in locating the pupil position. It achieved a detection rate of more than 90% for pupils within a 5-pixel error in all evaluated datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI