发布文献求助

Human Action Recognition in Still Images Using ConViT

帕斯卡（单位）卷积神经网络人工智能计算机科学变压器模式识别（心理学）图像（数学）视觉对象识别的认知神经科学动作识别计算机视觉机器学习对象（语法）工程类班级（哲学）电压电气工程程序设计语言

作者

Seyed Rohollah Hosseyni,Hasan Taheri,Sanaz Seyedin,Ali Ahmad Rahmani

出处

期刊：Cornell University - arXiv 日期：2023-01-01

链接

arxiv.org datacite.orgdoi.org

标识

DOI：10.48550/arxiv.2307.08994

摘要

Understanding the relationship between different parts of an image is crucial in a variety of applications, including object recognition, scene understanding, and image classification. Despite the fact that Convolutional Neural Networks (CNNs) have demonstrated impressive results in classifying and detecting objects, they lack the capability to extract the relationship between different parts of an image, which is a crucial factor in Human Action Recognition (HAR). To address this problem, this paper proposes a new module that functions like a convolutional layer that uses Vision Transformer (ViT). In the proposed model, the Vision Transformer can complement a convolutional neural network in a variety of tasks by helping it to effectively extract the relationship among various parts of an image. It is shown that the proposed model, compared to a simple CNN, can extract meaningful parts of an image and suppress the misleading parts. The proposed model has been evaluated on the Stanford40 and PASCAL VOC 2012 action datasets and has achieved 95.5% mean Average Precision (mAP) and 91.5% mAP results, respectively, which are promising compared to other state-of-the-art methods.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: CipherSage上传了应助文件

刚刚; 哈哈大笑上传了应助文件

1秒前; 科研通AI5的应助被今今采纳，获得10

1秒前; 通～发布了新的文献求助10

1秒前; YY完成签到，获得积分10

1秒前; 首席医官完成签到，获得积分10

2秒前; 坚定迎天完成签到，获得积分10

2秒前; Zzzoey发布了新的文献求助10

3秒前; 搜集达人的应助被小罗飞飞飞采纳，获得10

3秒前; 詹卫卫完成签到，获得积分10

3秒前; 乐乐上传了应助文件

3秒前; 宇_发布了新的文献求助20

3秒前; 请叫我风吹麦浪上传了应助文件

4秒前; esdeath发布了新的文献求助10

4秒前; 云轩完成签到，获得积分10

4秒前; 神勇的戒指发布了新的文献求助10

4秒前; CipherSage上传了应助文件

4秒前; 自然乐松发布了新的文献求助10

4秒前; yesir完成签到，获得积分10

5秒前; 普雅花的等待完成签到，获得积分10

5秒前; 想人陪的以云完成签到，获得积分10

6秒前; 科研通AI5的应助被德德采纳，获得10

6秒前; NexusExplorer的应助被李来仪采纳，获得10

6秒前; 威康宇宙发布了新的文献求助10

6秒前; 小蘑菇的应助被润润轩轩采纳，获得10

6秒前; 领导范儿上传了应助文件

7秒前; 年轻的咖啡豆发布了新的文献求助10

7秒前; 个性尔槐发布了新的文献求助10

7秒前; xiangxl完成签到，获得积分10

7秒前; fang完成签到，获得积分10

8秒前; 汉堡包的应助被zhui采纳，获得10

8秒前; Orange上传了应助文件

8秒前; 万万完成签到，获得积分10

8秒前; sci完成签到，获得积分10

9秒前; 背后的诺言发布了新的文献求助10

9秒前; 科研通AI5的应助被马静雨采纳，获得50

9秒前; Lucas的应助被酷炫板凳采纳，获得10

9秒前; 情怀上传了应助文件

9秒前; FFFFFFG完成签到，获得积分10

10秒前; 完美世界的应助被0000采纳，获得30

11秒前

高分求助中: Continuum Thermodynamics and Material Modelling 3000; Production Logging: Theoretical and Interpretive Elements 2700; Social media impact on athlete mental health: #RealityCheck 1020; Ensartinib (Ensacove) for Non-Small Cell Lung Cancer 1000; Unseen Mendieta: The Unpublished Works of Ana Mendieta 1000; Bacterial collagenases and their clinical applications 800; El viaje de una vida: Memorias de María Lecea 800

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3527849; 求助须知：如何正确求助？哪些是违规求助？ 3107938; 关于积分的说明 9287239; 捐赠科研通 2805706; 什么是DOI，文献DOI怎么找？ 1540033; 邀请新用户注册赠送积分活动 716893; 科研通“疑难数据库（出版商）”最低求助积分说明 709794

今日热心研友

科研小民工

故意的傲玉

请叫我风吹麦浪

默默的皮牙子

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通