讽刺
计算机科学
情态动词
人工智能
自然语言处理
语义学(计算机科学)
卷积神经网络
图形
背景(考古学)
语言学
理论计算机科学
讽刺
古生物学
化学
高分子化学
程序设计语言
哲学
生物
作者
Lingshan Li,Di Jin,Xiaobao Wang,Fan Guo,Longbiao Wang,Jianwu Dang
标识
DOI:10.1109/ictai59109.2023.00138
摘要
Sarcasm, a linguistic technique employed to express emotions opposite to their literal meaning, has garnered significant attention from researchers due to the rise of social media. Detecting sarcasm in a multi-modal context has become a focal point in recent studies. However, existing research primarily relies on identifying inconsistencies between text semantics and image semantics, often lacking a deep understanding of images. Consequently, capturing inconsistencies between images and texts poses a challenge in many cases. In this paper, we propose the Entity-Relational Graph Convolutional Network (ERGCN) as a solution to detect sarcasm by examining the relationship between entities within images. Our approach involves extracting entities and text descriptions from each image, which provides valuable entity information. Subsequently, we employ external knowledge to construct a cross-modal graph for each text and image pair, emphasizing the presence of internal contradictory information. Finally, we utilize the graph convolutional network to identify inconsistent information across modalities and successfully detect sarcasm. Experimental results demonstrate that our model achieves state-of-the-art performance on a widely used multimodal Twitter dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI