Text-to-Image Vehicle Re-Identification: Multi-Scale Multi-View Cross-Modal Alignment Network and a Unified Benchmark

计算机科学 鉴定(生物学) 人工智能 水准点(测量) 情态动词 图像(数学) 集合(抽象数据类型) 秩(图论) 判决 比例(比率) 计算机视觉 模式识别(心理学) 机器学习 物理 组合数学 生物 量子力学 化学 植物 高分子化学 程序设计语言 地理 数学 大地测量学
作者
Leqi Ding,Lei Liu,Yan Huang,Chenglong Li,Cheng Zhang,Sheng Wang,Liang Wang
出处
期刊:IEEE Transactions on Intelligent Transportation Systems [Institute of Electrical and Electronics Engineers]
卷期号:25 (7): 7673-7686 被引量:5
标识
DOI:10.1109/tits.2023.3348599
摘要

Vehicle Re-IDentification (Re-ID) aims to retrieve the most similar images with a given query vehicle image from a set of images captured by non-overlapping cameras, and plays a crucial role in intelligent transportation systems and has made impressive advancements in recent years. In real-world scenarios, we can often acquire the text descriptions of target vehicle through witness accounts, and then manually search the image queries for vehicle Re-ID, which is time-consuming and labor-intensive. To solve this problem, this paper introduces a new fine-grained cross-modal retrieval task called text-to-image vehicle re-identification, which seeks to retrieve target vehicle images based on the given text descriptions. To bridge the significant gap between language and visual modalities, we propose a novel Multi-scale multi-view Cross-modal Alignment Network (MCANet). In particular, we incorporate view masks and multi-scale features to align image and text features in a progressive way. In addition, we design the Masked Bidirectional InfoNCE (MB-InfoNCE) loss to enhance the training stability and make the best use of negative samples. To provide an evaluation platform for text-to-image vehicle re-identification, we create a Text-to-Image Vehicle Re-Identification dataset (T2I VeRi), which contains 2465 image-text pairs from 776 vehicles with an average sentence length of 26.8 words. Extensive experiments conducted on T2I VeRi demonstrate MCANet outperforms the current state-of-art (SOTA) method by 2.2% in rank-1 accuracy.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
徐哗啦完成签到,获得积分10
3秒前
李婉莹李婉莹完成签到,获得积分20
3秒前
67完成签到 ,获得积分10
4秒前
4秒前
5秒前
6秒前
浅尝离白完成签到,获得积分0
6秒前
丘比特应助ay采纳,获得10
7秒前
fanglin123完成签到,获得积分10
8秒前
9秒前
跳跳熊完成签到,获得积分10
10秒前
10秒前
思源应助要减肥天问采纳,获得10
10秒前
幸福岩发布了新的文献求助30
11秒前
11秒前
12秒前
寒冷孤风完成签到,获得积分10
14秒前
Steven发布了新的文献求助10
15秒前
JamesPei应助笑点低代萱采纳,获得10
16秒前
16秒前
果果发布了新的文献求助10
17秒前
paper完成签到 ,获得积分10
18秒前
18秒前
22秒前
温暖白柏发布了新的文献求助10
24秒前
伶俐的如松完成签到,获得积分10
25秒前
26秒前
Steven发布了新的文献求助10
26秒前
27秒前
27秒前
Liufgui应助林洁佳采纳,获得30
28秒前
你吼发布了新的文献求助10
28秒前
徐嘿嘿完成签到,获得积分20
29秒前
Bown完成签到 ,获得积分10
29秒前
sky发布了新的文献求助20
30秒前
100完成签到,获得积分10
30秒前
柒柒球发布了新的文献求助10
32秒前
32秒前
hzh完成签到 ,获得积分10
33秒前
高分求助中
The Mother of All Tableaux: Order, Equivalence, and Geometry in the Large-scale Structure of Optimality Theory 3000
A new approach to the extrapolation of accelerated life test data 1000
Problems of point-blast theory 400
北师大毕业论文 基于可调谐半导体激光吸收光谱技术泄漏气体检测系统的研究 390
Phylogenetic study of the order Polydesmida (Myriapoda: Diplopoda) 370
Robot-supported joining of reinforcement textiles with one-sided sewing heads 320
Novel Preparation of Chitin Nanocrystals by H2SO4 and H3PO4 Hydrolysis Followed by High-Pressure Water Jet Treatments 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 冶金 细胞生物学 免疫学
热门帖子
关注 科研通微信公众号,转发送积分 3998871
求助须知:如何正确求助?哪些是违规求助? 3538355
关于积分的说明 11273977
捐赠科研通 3277299
什么是DOI,文献DOI怎么找? 1807509
邀请新用户注册赠送积分活动 883909
科研通“疑难数据库(出版商)”最低求助积分说明 810075