性格(数学)
计算机科学
跳跃式监视
光学字符识别
序列(生物学)
背景(考古学)
注释
人工智能
集合(抽象数据类型)
自然语言处理
模式识别(心理学)
语音识别
图像(数学)
数学
程序设计语言
几何学
古生物学
生物
遗传学
作者
Hoang-Quan Dang,Duy-Anh Nguyen,Phu-Phuoc Pham,Ngoc-Thinh Nguyen,Tan Chau,Duc-Vu Ngo,Trung-Hieu Nguyen,Chau-Thang Phan,The-Hien Trinh,Minh-Tri Nguyen,Trong-Hop Do
标识
DOI:10.1109/rivf55975.2022.10013842
摘要
In this article, we introduce the NomNaOCR dataset for the old Hán-Nôm script based on 3 tremendous and valuable historical works of Vietnam, including , and With 2953 handwritten Pages collected from the Vietnamese Nôm Preservation Foundation for analyzing and semi-annotating the bounding boxes to generate additional 38,318 Patches containing text along with strings in digital form. This makes NomNaOCR currently become the biggest dataset for script in Vietnam, serving 2 main problems in Optical Character Recognition: Text Detection and Text Recognition. A difference here is that our implementations were all done at the sequence level, which not only saves the annotation cost but also helps us retain the context in the sequence instead of just performing on each individual character as in most previous works. For basic results, we experimented on the validation set of NomNaOCR. By using DBNet model for Text Detection, we reached a F1-score up to 99.65%. With Text Recognition, we used CRNN model and achieved an accuracy of 29.41% at sequence level and 84.73% at character level.
科研通智能强力驱动
Strongly Powered by AbleSci AI