发布文献求助

Multidirection and Multiscale Pyramid in Transformer for Video-Based Pedestrian Retrieval

计算机科学棱锥（几何）变压器人工智能计算机视觉行人特征提取特征（语言学）模式识别（心理学）工程类电压数学语言学哲学几何学电气工程运输工程

作者

Xianghao Zang,Ge Li,Wei Gao

出处

期刊：IEEE Transactions on Industrial Informatics [Institute of Electrical and Electronics Engineers]
日期：2022-02-15 卷期号：18 (12): 8776-8785 被引量：55

链接

arxiv.org arxiv.org datacite.orgdoi.org

标识

DOI：10.1109/tii.2022.3151766

摘要

In video surveillance, pedestrian retrieval (also called person re-identification) is a critical task. This task aims to retrieve the pedestrian of interest from non-overlapping cameras. Recently, transformer-based models have achieved significant progress for this task. However, these models still suffer from ignoring fine-grained, part-informed information. This paper proposes a multi-direction and multi-scale Pyramid in Transformer (PiT) to solve this problem. In transformer-based architecture, each pedestrian image is split into many patches. Then, these patches are fed to transformer layers to obtain the feature representation of this image. To explore the fine-grained information, this paper proposes to apply vertical division and horizontal division on these patches to generate different-direction human parts. These parts provide more fine-grained information. To fuse multi-scale feature representation, this paper presents a pyramid structure containing global-level information and many pieces of local-level information from different scales. The feature pyramids of all the pedestrian images from the same video are fused to form the final multi-direction and multi-scale feature representation. Experimental results on two challenging video-based benchmarks, MARS and iLIDS-VID, show the proposed PiT achieves state-of-the-art performance. Extensive ablation studies demonstrate the superiority of the proposed pyramid structure. The code is available at https://git.openi.org.cn/zangxh/PiT.git.

求助该文献

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

新增更精细的自定义提醒设置 (2026-1-4)

新增

🕒每天60秒读懂世界·精选全球要闻 (2026-1-2)

更新

2025年影响因子查询已上线 (2025-6-18)

新增

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 大模型上传了应助文件

刚刚; 量子星尘发布了新的文献求助10

1秒前; 一期一会发布了新的文献求助30

2秒前; 英俊皮卡丘完成签到，获得积分10

2秒前; NexusExplorer的应助被芋头采纳，获得10

3秒前; 任某人完成签到，获得积分10

4秒前; 小叶同学完成签到，获得积分10

4秒前; 勇敢的心发布了新的文献求助10

4秒前; 传奇3上传了应助文件

5秒前; 欣慰雪巧关闭了欣慰雪巧的文献求助

5秒前; reap关闭了reap的文献求助

5秒前; 英俊皮卡丘发布了新的文献求助10

5秒前; FashionBoy上传了应助文件

6秒前; 田様上传了应助文件

6秒前; bkagyin上传了应助文件

6秒前; Jasper上传了应助文件

6秒前; Arlene完成签到，获得积分10

6秒前; Aryan关注了科研通微信公众号

6秒前; 彭于晏上传了应助文件

7秒前; 科目三上传了应助文件

7秒前; 852上传了应助文件

7秒前; hokin33完成签到，获得积分10

8秒前; 小马甲的应助被菜菜mm采纳，获得10

8秒前; jyk发布了新的文献求助10

10秒前; 量子星尘发布了新的文献求助10

10秒前; 杏杏发布了新的文献求助10

10秒前; 笨笨忘幽关注了科研通微信公众号

10秒前; 张一一完成签到，获得积分10

11秒前; 惜肉龟发布了新的文献求助10

11秒前; 科研通AI6.1上传了应助文件

11秒前; 坦率导师sw发布了新的文献求助10

12秒前; aloopp发布了新的文献求助10

12秒前; 科研通AI6.1上传了应助文件

12秒前; 慕青的应助被鳗鱼铸海采纳，获得10

13秒前; 乐乐的应助被英俊皮卡丘采纳，获得10

13秒前; 思源的应助被聪慧的雪糕采纳，获得10

14秒前; Ava的应助被高天雨采纳，获得20

14秒前; yyang发布了新的文献求助10

14秒前; xn发布了新的文献求助10

15秒前; XinChenLee完成签到，获得积分10

15秒前

高分求助中: (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; Encyclopedia of Quaternary Science Reference Third edition 6000; Encyclopedia of Forensic and Legal Medicine Third Edition 5000; Introduction to strong mixing conditions volume 1-3 5000; Aerospace Engineering Education During the First Century of Flight 3000; Agyptische Geschichte der 21.30. Dynastie 3000; Les Mantodea de guyane 2000

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 5785120; 求助须知：如何正确求助？哪些是违规求助？ 5686059; 关于积分的说明 15466834; 捐赠科研通 4914228; 什么是DOI，文献DOI怎么找？ 2645117; 邀请新用户注册赠送积分活动 1592946; 关于科研通互助平台的介绍 1547300

今日热心研友

专注的问寒

沉默的倔驴

殷勤的紫槐

闪闪的诗珊

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2026 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：821889395【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通