计算机科学
变压器
卷积神经网络
人工智能
提取器
自然语言处理
特征(语言学)
模式识别(心理学)
词(群论)
特征提取
语言学
工程类
哲学
电压
电气工程
工艺工程
作者
Shiwen Sun,Hongxi Wei,Yiming Wang
出处
期刊:Communications in computer and information science
日期:2023-11-30
卷期号:: 165-176
标识
DOI:10.1007/978-981-99-8178-6_13
摘要
Mongolian ancient documents are an indispensable source for studying Mongolian traditional culture. To thoroughly explore and effectively utilize these ancient documents, conducting a comprehensive study on Mongolian ancient document words recognition is essential. In order to better recognize the word images in Mongolian ancient documents, this paper proposes an approach that combines convolutional neural networks with Transformer models. The approach used in this paper takes word images as the input for the model. After passing through a feature extractor composed of convolutional neural networks, the extracted features are fed into a Transformer model for prediction. Finally, the corresponding recognition results of the word images are obtained. Due to the common existence of imbalanced distribution of character classes in recognition tasks, models often tend to excessively focus on common characters while neglecting rare characters. Our proposed approach integrates focal loss to enhance the model's attention towards rare characters, thereby improving the overall recognition performance of the model for all characters. After training, the model is capable of rapidly and efficiently performing end-to-end recognition of words in Mongolian ancient documents. The experimental results indicate that our proposed approach outperforms existing methods for word recognition in Mongolian ancient documents, effectively improving the performance of Mongolian ancient document words recognition.
科研通智能强力驱动
Strongly Powered by AbleSci AI