亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Fine-Grained Video Captioning via Graph-based Multi-Granularity Interaction Learning

计算机科学 隐藏字幕 叙述的 粒度 任务(项目管理) 人工智能 自然语言处理 人机交互 多媒体 语言学 操作系统 图像(数学) 哲学 经济 管理
作者
Yichao Yan,Ning Zhuang,Bingbing Ni,Jian Zhang,Minghao Xu,Qiang Zhang,Zheng Zhang,Shuo Cheng,Qi Tian,Yi Xu,Xiaokang Yang,Wenjun Zhang
出处
期刊:IEEE Transactions on Pattern Analysis and Machine Intelligence [IEEE Computer Society]
卷期号:44 (2): 666-683 被引量:18
标识
DOI:10.1109/tpami.2019.2946823
摘要

Learning to generate continuous linguistic descriptions for multi-subject interactive videos in great details has particular applications in team sports auto-narrative. In contrast to traditional video caption, this task is more challenging as it requires simultaneous modeling of fine-grained individual actions, uncovering of spatio-temporal dependency structures of frequent group interactions, and then accurate mapping of these complex interaction details into long and detailed commentary. To explicitly address these challenges, we propose a novel framework Graph-based Learning for Multi-Granularity Interaction Representation (GLMGIR) for fine-grained team sports auto-narrative task. A multi-granular interaction modeling module is proposed to extract among-subjects' interactive actions in a progressive way for encoding both intra- and inter-team interactions. Based on the above multi-granular representations, a multi-granular attention module is developed to consider action/event descriptions of multiple spatio-temporal resolutions. Both modules are integrated seamlessly and work in a collaborative way to generate the final narrative. In the meantime, to facilitate reproducible research, we collect a new video dataset from YouTube.com called Sports Video Narrative dataset (SVN). It is a novel direction as it contains 6K team sports videos (i.e., NBA basketball games) with 10K ground-truth narratives(e.g., sentences). Furthermore, as previous metrics such as METEOR (i.e., used in coarse-grained video caption task) DO NOT cope with fine-grained sports narrative task well, we hence develop a novel evaluation metric named Fine-grained Captioning Evaluation (FCE), which measures how accurate the generated linguistic description reflects fine-grained action details as well as the overall spatio-temporal interactional structure. Extensive experiments on our SVN dataset have demonstrated the effectiveness of the proposed framework for fine-grained team sports video auto-narrative.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
科研通AI6.1应助zxh采纳,获得10
9秒前
12秒前
lqkcqmu发布了新的文献求助10
17秒前
ferritin完成签到 ,获得积分10
18秒前
20秒前
24秒前
lqkcqmu发布了新的文献求助30
27秒前
zxh发布了新的文献求助10
31秒前
32秒前
35秒前
lqkcqmu发布了新的文献求助10
37秒前
孤独黑猫完成签到 ,获得积分0
39秒前
43秒前
lqkcqmu发布了新的文献求助30
48秒前
zxh完成签到,获得积分10
49秒前
吴桂学完成签到 ,获得积分10
51秒前
53秒前
lqkcqmu发布了新的文献求助10
58秒前
1分钟前
lqkcqmu发布了新的文献求助10
1分钟前
1分钟前
1分钟前
1分钟前
聂_发布了新的文献求助10
1分钟前
lqkcqmu发布了新的文献求助10
1分钟前
1分钟前
chiien完成签到 ,获得积分10
1分钟前
1分钟前
lucky发布了新的文献求助10
1分钟前
1分钟前
1分钟前
李爱国应助科研通管家采纳,获得10
1分钟前
1分钟前
xxj完成签到 ,获得积分10
1分钟前
李健应助xintai采纳,获得10
1分钟前
lqkcqmu发布了新的文献求助10
1分钟前
Gagaga发布了新的文献求助10
1分钟前
1分钟前
何何发布了新的文献求助10
1分钟前
1分钟前
高分求助中
The Wiley Blackwell Companion to Diachronic and Historical Linguistics 3000
Standards for Molecular Testing for Red Cell, Platelet, and Neutrophil Antigens, 7th edition 1000
HANDBOOK OF CHEMISTRY AND PHYSICS 106th edition 1000
ASPEN Adult Nutrition Support Core Curriculum, Fourth Edition 1000
Signals, Systems, and Signal Processing 610
脑电大模型与情感脑机接口研究--郑伟龙 500
GMP in Practice: Regulatory Expectations for the Pharmaceutical Industry 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6291600
求助须知:如何正确求助?哪些是违规求助? 8109634
关于积分的说明 16967086
捐赠科研通 5355318
什么是DOI,文献DOI怎么找? 2845657
邀请新用户注册赠送积分活动 1823020
关于科研通互助平台的介绍 1678538