任务(项目管理)
图像(数学)
计算机科学
人气
人工智能
编码器
计算机视觉
情报检索
心理学
工程类
社会心理学
系统工程
操作系统
作者
Xinghui Song,Peipei Zhu
出处
期刊:Lecture notes in electrical engineering
日期:2024-01-01
卷期号:: 41-51
标识
DOI:10.1007/978-981-99-9239-3_4
摘要
In recent decades, the confluence of CV and NLP technologies has grown in popularity. Many researchers have focused their attention on Image caption task. In recent years, academics have been more interested in image aesthetic description because of image aesthetic indicative of the level. In this study, we present an aesthetic description technique that combines image description and aesthetic description at the same time. We use a Siamese network to acquire datasets for training from two data domains: Image caption task and Image aesthetic description task. The parameters gained from training were migrated back to the conventional Encoder-Decoder model for testing after training. On image caption task, we chose the flickr8k datasets to reduce computing cost. On aesthetic task, the PCCD datasets was used. The final findings indicate that our technique is capable of simultaneously training datasets from two data domains and producing both kinds of image descriptions.
科研通智能强力驱动
Strongly Powered by AbleSci AI