情态动词
接地
计算机科学
比例(比率)
图形
理论计算机科学
物理
工程类
材料科学
电气工程
量子力学
高分子化学
作者
Jingping Liu,Mingchuan Zhang,Weichen Li,Chao Wang,Shuang Li,Haiyun Jiang,Sihang Jiang,Yanghua Xiao,Yunwen Chen
出处
期刊:Proceedings of the ... AAAI Conference on Artificial Intelligence
[Association for the Advancement of Artificial Intelligence (AAAI)]
日期:2024-03-24
卷期号:38 (17): 18653-18661
被引量:2
标识
DOI:10.1609/aaai.v38i17.29828
摘要
Much effort has been devoted to building multi-modal knowledge graphs by visualizing entities on images, but ignoring the multi-modal information of the relation between entities. Hence, in this paper, we aim to construct a new large-scale multi-modal knowledge graph with triplet facts grounded on images that reflect not only entities but also their relations. To achieve this purpose, we propose a novel pipeline method, including triplet fact filtering, image retrieving, entity-based image filtering, relation-based image filtering, and image clustering. In this way, a multi-modal knowledge graph named ImgFact is constructed, which contains 247,732 triplet facts and 3,730,805 images. In experiments, the manual and automatic evaluations prove the reliable quality of our ImgFact. We further use the obtained images to enhance model performance on two tasks. In particular, the model optimized by our ImgFact achieves an impressive 8.38% and 9.87% improvement over the solutions enhanced by an existing multi-modal knowledge graph and VisualChatGPT on F1 of relation classification. We release ImgFact and its instructions at https://github.com/kleinercubs/ImgFact.
科研通智能强力驱动
Strongly Powered by AbleSci AI