情态动词
讽刺
计算机科学
模式
模态(人机交互)
人工智能
语言学
社会学
讽刺
哲学
高分子化学
社会科学
化学
作者
Bin Liang,Chenwei Lou,Xiang Li,Lin Gui,Min Yang,Ruifeng Xu
出处
期刊:ACM Multimedia
日期:2021-10-17
被引量:35
标识
DOI:10.1145/3474085.3475190
摘要
Sarcasm is a peculiar form and sophisticated linguistic act to express the incongruity of someone's implied sentiment expression, which is a pervasive phenomenon in social media platforms. Compared with sarcasm detection purely on texts, multi-modal sarcasm detection is more adapted to the rapidly growing social media platforms, where people are interested in creating multi-modal messages. When focusing on the multi-modal sarcasm detection for tweets consisting of texts and images on Twitter, the significant clue of improving the performance of multi-modal sarcasm detection evolves into how to determine the incongruity relations between texts and images. In this paper, we investigate multi-modal sarcasm detection from a novel perspective, so as to determine the sentiment inconsistencies within a certain modality and across different modalities by constructing heterogeneous in-modal and cross-modal graphs (InCrossMGs) for each multi-modal example. Based on it, we explore an interactive graph convolution network (GCN) structure to jointly and interactively learn the incongruity relations of in-modal and cross-modal graphs for determining the significant clues in sarcasm detection. Experimental results demonstrate that our proposed model achieves state-of-the-art performance in multi-modal sarcasm detection.
科研通智能强力驱动
Strongly Powered by AbleSci AI