笔迹
计算机科学
适应(眼睛)
自然语言处理
人工智能
手写体识别
写作风格
语音识别
语言学
特征提取
心理学
哲学
神经科学
作者
Anna Scius-Bertrand,Phillip Ströbel,Martin Völk,Tobias Hodel,Andreas Fischer
标识
DOI:10.1007/978-3-031-41676-7_23
摘要
One of the main challenges of automatically transcribing large collections of handwritten letters is to cope with the high variability of writing styles present in the collection. In particular, the writing styles of non-frequent writers, who have contributed only few letters, are often missing in the annotated learning samples used for training handwriting recognition systems. In this paper, we introduce the Bullinger dataset for writer adaptation, which is based on the Heinrich Bullinger letter collection from the 16th century, using a subset of 3,622 annotated letters (about 1.2 million words) from 306 writers. We provide baseline results for handwriting recognition with modern recognizers, before and after the application of standard techniques for supervised adaptation of frequent writers and self-supervised adaptation of non-frequent writers.
科研通智能强力驱动
Strongly Powered by AbleSci AI