自动汇总
计算机科学
工作流程
突出
背景(考古学)
自然语言处理
人工智能
情报检索
限制
阶段(地层学)
方案(数学)
数据挖掘
数据库
机械工程
古生物学
数学分析
数学
工程类
生物
作者
Jiaxin Duan,Fengyu Lu,Junfei Liu
标识
DOI:10.1109/bibm58861.2023.10385437
摘要
Medical dialogue summarization (MDS) refers to automatically generating electronic health records (EHRs) from doctor-patient dialogues to relieve doctors from the recording burden. The long-term nature of medical dialogue makes it more efficient to handle MDS with a two-stage abstractive summarization model, which first extracts salient content from the source text and then generates an abstract summary based on that. However, this commonly used extractive-abstractive paradigm struggles to identify absolute salient statements in the first stage while discards all information mistakenly considered unimportant, heavily limiting the second-stage summarization performance. In this paper, we introduce a novel two-stage model for MDS with a compact-then-abstract workflow to ensure data efficiency and information integrity. After predicting EHR-related utterances in a medical dialogue, our model adaptively compresses the remaining into a soft context and generates an EHR according to both the prediction and compression results. This way, we loosen the traditional first-stage extraction to a hybrid of extraction and compression, which makes the input context compact and avoids suffering from losing information for producing an EHR. Extensive experiments on two public datasets show that the proposed model significantly outperforms the state-of-the-art counterparts w.r.t. multiple metrics.
科研通智能强力驱动
Strongly Powered by AbleSci AI