发布文献求助

Instance-Level Semantic Alignment for Zero-Shot Cross-Modal Retrieval

计算机科学情态动词人工智能不变（物理）语义学（计算机科学）发电机（电路理论）班级（哲学）模式自然语言处理模式识别（心理学）数学功率（物理）程序设计语言高分子化学化学社会学物理量子力学数学物理社会科学

作者

Kai Wang,Yifan Wang,Xing Xu,Zhiwei Cao,Xunliang Cai

标识

DOI：10.1109/icme52920.2022.9860026

摘要

Zero-shot Cross-Modal Retrieval (ZS-CMR) is challenging due to the heterogeneous distributions across different modalities and the inconsistent semantics across seen and unseen classes. Previous methods usually perform class-level semantic alignment of data from different modalities by introducing auxiliary word embeddings of class labels, which have a fatal limitation as the learning of class-level information will lead to the ignorance of intra-modal variance. To solve this problem, we propose our Instance-Level Semantic Alignment (ILSA) method to make full use of the instance-level information. We use two disentanglement variational auto-encoders to decompose the data from two modalities into modal specific and modal invariant features. With an instance-level semantic features extractor and a distribution generator, ILSA could generate more appropriate distributions by the learned instance-level semantic features, without any auxiliary knowledge. We perform the experiment on six widely used datasets on two scenarios of ZS-CMR, the results show that our method establishes the new state-of-the-art performance on all datasets.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: Ava上传了应助文件

1秒前; 萧水白上传了应助文件

1秒前; xxddw发布了新的文献求助10

2秒前; 科目三上传了应助文件

2秒前; 英姑上传了应助文件

3秒前; 油条咔咔咔的应助被milk采纳，获得10

3秒前; 科研通AI2S上传了应助文件

5秒前; sy14发布了新的文献求助10

7秒前; 善学以致用的应助被Hehe采纳，获得10

8秒前; 棠真发布了新的文献求助10

8秒前; 西安浴日光能赵炜发布了新的文献求助10

8秒前; 科研通AI2S的应助被Yuanyuan采纳，获得10

9秒前; 年年发布了新的文献求助10

10秒前; 隐形曼青的应助被悦耳的襄采纳，获得10

11秒前; 顾矜的应助被柚子采纳，获得10

11秒前; ZG关闭了ZG的文献求助

13秒前; 笨笨友桃发布了新的文献求助10

15秒前; 赘婿的应助被周宋采纳，获得10

15秒前; shane发布了新的文献求助10

16秒前; 重要的一凡发布了新的文献求助40

18秒前; 酷波er的应助被红箭烟雨采纳，获得10

18秒前; 乐乐的应助被科研通管家采纳，获得10

19秒前; 机灵柚子的应助被科研通管家采纳，获得10

19秒前; 清爽妙竹的应助被科研通管家采纳，获得10

19秒前; 清爽妙竹的应助被科研通管家采纳，获得10

19秒前; 无花果的应助被科研通管家采纳，获得10

19秒前; Lucas的应助被科研通管家采纳，获得10

19秒前; 鸣笛的应助被科研通管家采纳，获得100

19秒前; 科研通管家关闭了高大的水米的文献求助

19秒前; 科研通管家关闭了贪玩的一曲的文献求助

19秒前; 科研通管家关闭了高兴的牛排的文献求助

19秒前; 十一十八的应助被昵昵昵昵呀采纳，获得10

21秒前; 充电宝的应助被红箭烟雨采纳，获得10

22秒前; SYLH的应助被木鸽子采纳，获得30

24秒前; 小猪完成签到，获得积分10

25秒前; 隐形曼青上传了应助文件

25秒前; NexusExplorer的应助被漫山采纳，获得10

27秒前; 海阔凭完成签到，获得积分10

28秒前; 奇异果完成签到，获得积分20

29秒前; 柊巳发布了新的文献求助10

29秒前

高分求助中: The Mother of All Tableaux Order, Equivalence, and Geometry in the Large-scale Structure of Optimality Theory 2400; Ophthalmic Equipment Market by Devices(surgical: vitreorentinal,IOLs,OVDs,contact lens,RGP lens,backflush,diagnostic&monitoring:OCT,actorefractor,keratometer,tonometer,ophthalmoscpe,OVD), End User,Buying Criteria-Global Forecast to2029 2000; Optimal Transport: A Comprehensive Introduction to Modeling, Analysis, Simulation, Applications 800; Official Methods of Analysis of AOAC INTERNATIONAL 600; ACSM’s Guidelines for Exercise Testing and Prescription, 12th edition 588; T/CIET 1202-2025 可吸收再生氧化纤维素止血材料 500; Comparison of adverse drug reactions of heparin and its derivates in the European Economic Area based on data from EudraVigilance between 2017 and 2021 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3953094; 求助须知：如何正确求助？哪些是违规求助？ 3498438; 关于积分的说明 11092087; 捐赠科研通 3229062; 什么是DOI，文献DOI怎么找？ 1785211; 邀请新用户注册赠送积分活动 869242; 科研通“疑难数据库（出版商）”最低求助积分说明 801415

今日热心研友

热心市民小红花

现代的访曼

眯眯眼的衬衫

jenningseastera

昏睡的蟠桃

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通