隐藏字幕
图像(数学)
计算机科学
人工智能
领域(数学分析)
计算机视觉
数学
数学分析
作者
Xindong You,Likun Lu,Hang Zhou,Xueqiang Lv
出处
期刊:Springer eBooks
[Springer Nature]
日期:2021-01-01
卷期号:: 108-119
标识
DOI:10.1007/978-3-030-77428-8_9
摘要
Although image captioning technology has made great progress in recent years, the quality of Chinese image description is far from enough. In this paper, we focus on the problem of Chinese image captioning with the aim to improve the quality of Chinese image description. A novel framework for Chinese image captioning based on image label information (CIC) is proposed in this paper. Firstly, image label information is extracted by a multi-layer model with shortcut connections. Then the label information is input into the neural network with an extension of LSTM, which we coin L-LSTM for short, to generate the Chinese image descriptions. Extensive experiments are conducted on various image caption datasets such as Flickr8k-cn, Flickr30 k-cn. The experimental results verify the effectiveness of the proposed framework (CIC). It obtains 27.1% and 21.2% BLEU4 average values of Flickr8k-cn and Flickr30k-cn, respectively, which outperforms the state-of-art model in Chinese image captioning domain.
科研通智能强力驱动
Strongly Powered by AbleSci AI