计算机科学
稳健性(进化)
感受野
字符识别
人工智能
领域(数学)
性格(数学)
模式识别(心理学)
语音识别
机制(生物学)
计算机视觉
图像(数学)
数学
生物化学
基因
认识论
哲学
化学
纯数学
几何学
作者
Haibo Qin,Chun Yang,Xiaobin Zhu,Xu-Cheng Yin
标识
DOI:10.1007/978-3-030-86331-9_15
摘要
Existing attention-based recognition methods generally assume that the character scale and spacing in the same text instance are basically consistent. However, this hypothesis not always hold in the context of complex scene images. In this study, we propose an innovative dynamic receptive field adaption (DRA) mechanism for recognizing scene text robustly. Our DRA introduces different levels of receptive field features for classifying character and designs a novel way to explore historical attention information when calculating attention map. In this way, our method can adaptively adjust receptive field according to the variations of character scale and spacing in a scene text. Hence, our DRA mechanism can generate more informative features for recognition than traditional attention-based mechanisms. Notablely, our DRA mechanism can be easily generalized to off-the-shelf attention-based methods in text recognition to improve their performances. Extensive experiments on various public available benchmarks, including the IIIT-5K, SVT, SVTP, CUTE80, and ICDAR datasets, indicate the effectiveness and robustness of our method against the state-of-the art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI