计算机科学
杠杆(统计)
人工智能
集合(抽象数据类型)
利用
推论
写作风格
自然语言处理
机器学习
语言学
计算机安全
哲学
程序设计语言
作者
Ayan Kumar Bhunia,Shuvozit Ghose,Amandeep Kumar,Pinaki Nath Chowdhury,Aneeshan Sain,Yi-Zhe Song
标识
DOI:10.1109/cvpr46437.2021.01557
摘要
Handwritten Text Recognition (HTR) remains a challenging problem to date, largely due to the varying writing styles that exist amongst us. Prior works however generally operate with the assumption that there is a limited number of styles, most of which have already been captured by existing datasets. In this paper, we take a completely different perspective – we work on the assumption that there is always a new style that is drastically different, and that we will only have very limited data during testing to perform adaptation. This creates a commercially viable solution – being exposed to the new style, the model has the best shot at adaptation, and the few-sample nature makes it practical to implement. We achieve this via a novel meta-learning framework which exploits additional new-writer data via a support set, and outputs a writer-adapted model via single gradient step update, all during inference (see Figure 1). We discover and leverage on the important insight that there exists few key characters per writer that exhibit relatively larger style discrepancies. For that, we additionally propose to meta-learn instance specific weights for a character-wise cross-entropy loss, which is specifically designed to work with the sequential nature of text data. Our writer-adaptive MetaHTR framework can be easily implemented on the top of most state-of-the-art HTR models. Experiments show an average performance gain of 5-7% can be obtained by observing very few new style data (≤ 16).
科研通智能强力驱动
Strongly Powered by AbleSci AI