计算机科学
情态动词
情报检索
亲密度
分类
典型相关
预处理器
模态(人机交互)
人工智能
数据挖掘
数学
数学分析
化学
高分子化学
作者
Susanta Malik,Nikhil Bhardwaj,Rahul Bhardwaj,Satish Kumar
出处
期刊:Lecture notes in networks and systems
日期:2022-11-10
卷期号:: 725-734
标识
DOI:10.1007/978-981-19-3148-2_62
摘要
AbstractCross-modal retrieval intends to empower adaptable recovery across various modalities. The center of cross-modal retrieval is the manner by which to quantify the substance similitude between various sorts of information. In this work, we deal with a cross-modal retrieval technique, called Canonical Correlation Analysis (CCA). It accepts one sort of information as the question to recover pertinent information of another sort. The given indexed lists across different modalities can be useful to the clients to get exhaustive data about the objective occasions or points. With the quick development of various kinds of media information like texts, pictures, and recordings on the Internet, cross-modal retrieval turns out to be progressively significant in true applications. As of late, cross-modal retrieval has drawn in the significant consideration of the analysts from both scholarly communities also, industry. The test of cross-modal retrieval is the ticket to gauge the substance closeness between various kinds of information since they, which is alluded to as the heterogeneity hole. After data preprocessing and learning the mappings in the same space, we will try to find out the most similar samples based on pre calculated features of the samples in a given format. We will keep the features learned by VGG-16 precalculated and the features learned by text model would then be used to search for the most similar image that best explains the caption.KeywordsCross-modal retrievalVGG-16ModalitiesCanonical Correlation Analysis (CCA)Neural network
科研通智能强力驱动
Strongly Powered by AbleSci AI