已入深夜,您辛苦了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!祝你早点完成任务,早点休息,好梦!

Multimodal Co-attention Transformer for Video-Based Personality Understanding

可解释性 模式 计算机科学 人工智能 人格 机器学习 可视化 多媒体 心理学 社会心理学 社会科学 社会学
作者
Mingwei Sun,Kunpeng Zhang
标识
DOI:10.1109/bigdata59044.2023.10386376
摘要

Video has emerged as a pervasive medium for communication, entertainment, and information sharing. With the consumption of video content continuing to increase rapidly, understanding the impact of visual narratives on personality has become a crucial area of research. While text-based personality understanding has been extensively studied in the literature, video-based personality prediction remains relatively under-explored. Existing approaches to video-based personality prediction can be broadly categorized into two directions: learning a joint representation of audio and visual information using fully-connected feed-forward networks, and separating a video into its individual modalities (text, image, and audio), training each modality independently, and then ensembling the results for subsequent personality prediction. However, both approaches have notable limitations: ignoring complex interactions between visual and audio components, or considering all three modalities but not in a joint manner. Furthermore, all methods require high computational costs as they require high-resolution images to train. In this paper, we propose a novel Multimodal Co-attention Transformer neural network for video-based affect prediction. Our approach simultaneously models audio, visual, and text representations, as well as their inter-relations, to achieve accurate and efficient predictions. We demonstrate the effectiveness of our method via extensive experiments on a real-world dataset: First Impressions. Our results show that the proposed model outperforms state-of-the-art approaches while maintaining high computational efficiency. In addition to our performance evaluation, we also conduct interpretability analyses to investigate the contribution across different levels. Our findings reveal valuable insights into personality predictions. The implementation is available at: https://github.com/nestor-sun/mcoattention.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
王明磊完成签到 ,获得积分10
1秒前
kk_1315完成签到,获得积分0
1秒前
在水一方应助奥利奥采纳,获得10
2秒前
染染爱喝柠檬茶完成签到 ,获得积分10
2秒前
上上签发布了新的文献求助10
4秒前
暴躁的凌柏完成签到 ,获得积分10
4秒前
落寞飞烟完成签到,获得积分10
6秒前
脑洞疼应助ER采纳,获得10
9秒前
852应助小丁采纳,获得10
9秒前
10秒前
11秒前
田様应助迷你的笑白采纳,获得10
12秒前
小小鱼发布了新的文献求助10
12秒前
orixero应助泊岸采纳,获得10
13秒前
14秒前
morena发布了新的文献求助10
16秒前
17秒前
ER发布了新的文献求助10
20秒前
21秒前
Leofar发布了新的文献求助10
23秒前
摆烂完成签到 ,获得积分10
23秒前
工藤新一发布了新的文献求助10
25秒前
任性铅笔完成签到 ,获得积分10
26秒前
27秒前
Chen完成签到,获得积分10
29秒前
Kevin完成签到 ,获得积分10
29秒前
妖九笙完成签到 ,获得积分10
30秒前
30秒前
陆漫完成签到 ,获得积分10
31秒前
31秒前
捞起完成签到,获得积分10
31秒前
今后应助上上签采纳,获得10
33秒前
嘉言懿行magnolia完成签到 ,获得积分10
34秒前
34秒前
34秒前
buerger完成签到,获得积分10
35秒前
靓丽的山蝶完成签到 ,获得积分10
37秒前
泊岸发布了新的文献求助10
38秒前
研友_VZG7GZ应助buerger采纳,获得10
39秒前
Criminology34举报蓝天求助涉嫌违规
40秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
The formation of Australian attitudes towards China, 1918-1941 640
Signals, Systems, and Signal Processing 610
Development Across Adulthood 600
天津市智库成果选编 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6444232
求助须知:如何正确求助?哪些是违规求助? 8258104
关于积分的说明 17590642
捐赠科研通 5503141
什么是DOI,文献DOI怎么找? 2901274
邀请新用户注册赠送积分活动 1878302
关于科研通互助平台的介绍 1717595