发布文献求助

Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter

计算机科学文本识别推论文本检测人工智能光学字符识别特征（语言学）语音识别自然语言处理补语（音乐）模式识别（心理学）图像（数学）哲学语言学生物化学化学互补基因表型

作者

Tianwei Wang,Yuanzhi Zhu,Lianwen Jin,Dezhi Peng,Zhe Li,Mengchao He,Yongpan Wang,Canjie Luo

链接

arxiv.org arxiv.orgdoi.org

标识

DOI：10.1109/cvpr46437.2021.00591

摘要

Text recognition is a popular research subject with many associated challenges. Despite the considerable progress made in recent years, the text recognition task itself is still constrained to solve the problem of reading cropped line text images and serves as a subtask of optical character recognition (OCR) systems. As a result, the final text recognition result is limited by the performance of the text detector. In this paper, we propose a simple, elegant and effective paradigm called Implicit Feature Alignment (IFA), which can be easily integrated into current text recognizers, resulting in a novel inference mechanism called IFA- inference. This enables an ordinary text recognizer to process multi-line text such that text detection can be completely freed. Specifically, we integrate IFA into the two most prevailing text recognition streams (attention-based and CTC-based) and propose attention-guided dense prediction (ADP) and Extended CTC (ExCTC). Furthermore, the Wasserstein-based Hollow Aggregation Cross-Entropy (WH-ACE) is proposed to suppress negative predictions to assist in training ADP and ExCTC. We experimentally demonstrate that IFA achieves state-of-the-art performance on end-to-end document recognition tasks while maintaining the fastest speed, and ADP and ExCTC complement each other on the perspective of different application scenarios. Code will be available at https://github.com/Wang-Tianwei/Implicit-feature-alignment.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 量子星尘发布了新的文献求助10

刚刚; 咎青文完成签到，获得积分10

1秒前; 没有昵称完成签到，获得积分10

1秒前; 不不鱼完成签到，获得积分10

1秒前; 呆萌的鼠标完成签到，获得积分0

1秒前; SYLH的应助被zhencheng采纳，获得10

1秒前; 追寻的丹烟完成签到，获得积分10

2秒前; reading gene发布了新的文献求助10

2秒前; Lily完成签到，获得积分10

3秒前; Hello的应助被WAN采纳，获得10

4秒前; 至幸完成签到，获得积分10

4秒前; swby完成签到，获得积分10

5秒前; Nalisher完成签到，获得积分10

5秒前; 华仔上传了应助文件

5秒前; 会飞的蜗牛发布了新的文献求助10

5秒前; 熠熠完成签到，获得积分10

6秒前; 研友_Y59785上传了应助文件

7秒前; hhhhhh上传了应助文件

7秒前; 轩辕访波发布了新的文献求助10

8秒前; 桐桐上传了应助文件

8秒前; MrH完成签到，获得积分10

9秒前; 陶远望完成签到，获得积分10

9秒前; 柒柒完成签到，获得积分10

9秒前; lasfjas完成签到，获得积分10

9秒前; HHHHH完成签到，获得积分10

9秒前; G蛋白偶联关闭了G蛋白偶联的文献求助

9秒前; FFF发布了新的文献求助10

10秒前; 熊猫盖浇饭完成签到，获得积分10

10秒前; 柠檬完成签到，获得积分10

10秒前; 慕慕倾完成签到，获得积分10

11秒前; Mine发布了新的文献求助10

11秒前; 共享精神的应助被甜美追命采纳，获得10

12秒前; 润润轩轩完成签到，获得积分10

12秒前; ROMANTIC完成签到，获得积分10

12秒前; 大气小小完成签到，获得积分20

12秒前; ywindm完成签到，获得积分10

13秒前; 大模型上传了应助文件

13秒前; jichups完成签到，获得积分10

13秒前; 我是老大的应助被YQ采纳，获得10

13秒前; 凉雨渲完成签到，获得积分10

14秒前

高分求助中: 【提示信息，请勿应助】关于scihub 10000; The Mother of All Tableaux: Order, Equivalence, and Geometry in the Large-scale Structure of Optimality Theory 3000; Social Research Methods (4th Edition) by Maggie Walter (2019) 2390; A new approach to the extrapolation of accelerated life test data 1000; 北师大毕业论文基于可调谐半导体激光吸收光谱技术泄漏气体检测系统的研究 390; Phylogenetic study of the order Polydesmida (Myriapoda: Diplopoda) 370; Robot-supported joining of reinforcement textiles with one-sided sewing heads 360

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 4009093; 求助须知：如何正确求助？哪些是违规求助？ 3548906; 关于积分的说明 11300209; 捐赠科研通 3283436; 什么是DOI，文献DOI怎么找？ 1810365; 邀请新用户注册赠送积分活动 886129; 科研通“疑难数据库（出版商）”最低求助积分说明 811259

今日热心研友

比比谁的速度快

元气少女猪刚鬣

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通