发布文献求助

Improving Cross-Modal Image-Text Retrieval With Teacher-Student Learning

计算机科学水准点（测量）图像（数学）情态动词人工智能集合（抽象数据类型）图像检索模式情报检索模式识别（心理学）社会学化学社会科学高分子化学程序设计语言地理大地测量学

作者

Junhao Liu,Min Yang,Chengming Li,Ruifeng Xu

出处

期刊：IEEE Transactions on Circuits and Systems for Video Technology [Institute of Electrical and Electronics Engineers]
日期：2021-08-01 卷期号：31 (8): 3242-3253 被引量：26

标识

DOI：10.1109/tcsvt.2020.3037661

摘要

Cross-modal image-text retrieval has emerged as a challenging task that requires the multimedia system to bridge the heterogeneity gap between different modalities. In this paper, we take full advantage of image-to-text and text-to-image generation models to improve the performance of the cross-modal image-text retrieval model by incorporating the text-grounded and image-grounded generative features into the cross-modal common space with a “Two-Teacher One-Student” learning framework. In addition, a dual regularizer network is designed to distinguish the mismatched image-text pairs from the matched ones. In this way, we can capture the fine-grained correspondence between modalities and distinguish the best-retrieved result from a candidate set. Extensive experiments on three benchmark datasets (i.e., MIRFLICKR-25K, NUS-WIDE, and MS COCO) show that our model can achieve state-of-the-art cross-modal retrieval results. In particular, our model improves the image-to-text and text-to-image retrieval accuracy by more than 22% over the best competitors on the MS COCO dataset.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 科研通AI5上传了应助文件

1秒前; 科研通AI2S上传了应助文件

2秒前; 肯德基没有黄焖鸡完成签到，获得积分10

2秒前; 能干冰露完成签到，获得积分10

5秒前; 牛奶拌可乐完成签到，获得积分10

7秒前; 量子星尘发布了新的文献求助30

7秒前; 周小鱼完成签到，获得积分10

11秒前; yar上传了应助文件

16秒前; FashionBoy上传了应助文件

24秒前; 老张完成签到，获得积分10

30秒前; fang上传了应助文件

32秒前; zhugao完成签到，获得积分10

34秒前; yar上传了应助文件

37秒前; 南风知我意完成签到，获得积分10

40秒前; 朴实寻琴完成签到，获得积分10

40秒前; 可可可爱完成签到，获得积分10

43秒前; lsy完成签到，获得积分10

47秒前; 量子星尘发布了新的文献求助10

50秒前; Hello上传了应助文件

51秒前; 李健的小迷弟的应助被落寞凌波采纳，获得10

51秒前; hwen1998完成签到，获得积分10

54秒前; 香蕉觅云上传了应助文件

55秒前; 幸福的杨小夕发布了新的文献求助10

56秒前; wwb发布了新的文献求助10

59秒前; yar上传了应助文件

1分钟前; 李健的小迷弟上传了应助文件

1分钟前; LHT完成签到，获得积分10

1分钟前; 落寞凌波发布了新的文献求助10

1分钟前; 桐桐的应助被幸福的杨小夕采纳，获得10

1分钟前; 韩麒嘉完成签到，获得积分10

1分钟前; 聪慧的凝海完成签到，获得积分0

1分钟前; fang上传了应助文件

1分钟前; wwb发布了新的文献求助10

1分钟前; phil完成签到，获得积分10

1分钟前; yar上传了应助文件

1分钟前; 高高菠萝完成签到，获得积分10

1分钟前; 滴滴滴完成签到，获得积分10

1分钟前; yangsi完成签到，获得积分10

1分钟前; 量子星尘发布了新的文献求助10

1分钟前; orixero上传了应助文件

1分钟前

高分求助中: 【提示信息，请勿应助】关于scihub 10000; Les Mantodea de Guyane: Insecta, Polyneoptera [The Mantids of French Guiana] 3000; 徐淮辽南地区新元古代叠层石及生物地层 3000; The Mother of All Tableaux: Order, Equivalence, and Geometry in the Large-scale Structure of Optimality Theory 3000; Handbook of Industrial Diamonds.Vol2 1100; Global Eyelash Assessment scale (GEA) 1000; Picture Books with Same-sex Parented Families: Unintentional Censorship 550

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 4038029; 求助须知：如何正确求助？哪些是违规求助？ 3575740; 关于积分的说明 11373751; 捐赠科研通 3305559; 什么是DOI，文献DOI怎么找？ 1819224; 邀请新用户注册赠送积分活动 892652; 科研通“疑难数据库（出版商）”最低求助积分说明 815022

今日热心研友

热心市民小红花

比比谁的速度快

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通