对话
计算机科学
对话框
编码
有向无环图
有向图
自然语言处理
人工智能
图形
语音识别
理论计算机科学
算法
心理学
沟通
生物化学
化学
万维网
基因
作者
Xiaojun Quan,Siyue Wu,Junqing Chen,Weizhou Shen,Jianxing Yu
出处
期刊:IEEE Transactions on Affective Computing
[Institute of Electrical and Electronics Engineers]
日期:2023-05-08
卷期号:: 1-17
被引量:1
标识
DOI:10.1109/taffc.2023.3273589
摘要
Multi-party conversation modeling plays a vital role in emotion recognition in conversation (ERC). Aside from the intra- and inter-speaker dependencies between different speakers, the difficulty also lies in the fact that each conversation may contain several to many utterances that compose a long text sequence. In this article, we present two approaches to effective multi-party conversation modeling. First, to encode long sequences and capture long-range dependency between utterances, we introduce a dialog-oriented language model, DialogXL, with enhanced memory to store longer conversation sequences and dialog-aware self-attention to deal with multi-party dependencies. Second, we present a directed acyclic neural network, namely DAG-ERC, to encode the utterances with a directed acyclic graph (DAG) to better capture the intrinsic structure within a conversation. DAG-ERC combines the advantages of recurrent models and graph models and provides a more intuitive way to model information flow between sequential utterances. Extensive experiments are conducted on four ERC benchmarks with state-of-the-art models employed for comparison, and empirical results demonstrate the superiority of the two models in multi-party conversation modeling.
科研通智能强力驱动
Strongly Powered by AbleSci AI