表(数据库)
计算机科学
邻接表
人工智能
回归
标记语言
编码(集合论)
数据挖掘
树(集合论)
机器学习
算法
程序设计语言
统计
数学
数学分析
集合(抽象数据类型)
XML
操作系统
作者
Hangdi Xing,Feiyu Gao,Rujiao Long,Jiajun Bu,Qi Zheng,Liangcheng Li,Cong Ye,Zhi Yu
出处
期刊:Proceedings of the ... AAAI Conference on Artificial Intelligence
[Association for the Advancement of Artificial Intelligence (AAAI)]
日期:2023-06-26
卷期号:37 (3): 2992-3000
被引量:1
标识
DOI:10.1609/aaai.v37i3.25402
摘要
Table structure recognition (TSR) aims at extracting tables in images into machine-understandable formats. Recent methods solve this problem by predicting the adjacency relations of detected cell boxes, or learning to generate the corresponding markup sequences from the table images. However, they either count on additional heuristic rules to recover the table structures, or require a huge amount of training data and time-consuming sequential decoders. In this paper, we propose an alternative paradigm. We model TSR as a logical location regression problem and propose a new TSR framework called LORE, standing for LOgical location REgression network, which for the first time combines logical location regression together with spatial location regression of table cells. Our proposed LORE is conceptually simpler, easier to train and more accurate than previous TSR models of other paradigms. Experiments on standard benchmarks demonstrate that LORE consistently outperforms prior arts. Code is available at https:// github.com/AlibabaResearch/AdvancedLiterateMachinery/tree/main/DocumentUnderstanding/LORE-TSR.
科研通智能强力驱动
Strongly Powered by AbleSci AI