解码方法
计算机科学
树(集合论)
领域(数学)
序列(生物学)
顺序译码
人工智能
表达式(计算机科学)
笔迹
树形结构
模式识别(心理学)
算法
数学
二叉树
数学分析
区块代码
生物
遗传学
程序设计语言
纯数学
作者
Zhe Li,Wentao Yang,Hengnian Qi,Lianwen Jin,Yichao Huang,Kai Ding
标识
DOI:10.1016/j.patcog.2023.110220
摘要
Handwritten mathematical expression recognition (HMER) is a challenging task in the field of computer vision due to the complex two-dimensional spatial structure and diverse handwriting styles of mathematical expressions (MEs). Recent mainstream approach treats MEs as objects with tree structures, modeled by sequence decoders or tree decoders. These decoders recognize the symbols and relationships between symbols in MEs in depth-first order, resulting in long decoding steps that can harm their performance, particularly for MEs with complex structures. In this paper, we propose a novel tree-based model with branch parallel decoding for HMER, which parses the structures of ME trees by explicitly predicting the relationships between symbols. In addition, a query constructing module is proposed to assist the decoder in decoding the branches of ME trees in parallel, thus reducing the number of decoding time steps and alleviating the problem of long sequence attention decoding. As a result, our model outperforms existing models on three widely-used benchmarks and demonstrates significant improvements in HMER performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI