自动汇总
计算机科学
判决
突出
情报检索
图形
多文档摘要
节点(物理)
自然语言处理
人工神经网络
主题(计算)
人工智能
万维网
理论计算机科学
结构工程
工程类
作者
Zhen Zhang,Wan Soo Yun,Xiyuan Jia,Qiujian Lv,Hao Ni,Xin Wang,Guohua Wu
出处
期刊:Communications in computer and information science
日期:2023-11-26
卷期号:: 136-149
标识
DOI:10.1007/978-981-99-8138-0_12
摘要
Extractive text summarization aims to select salient sentences from documents. However, most existing extractive methods struggle to capture inter-sentence relations in long documents. In addition, the hierarchical structure information of the document is ignored. For example, some scientific documents have fixed chapters, and sentences in the same chapter have the same theme. To solve these problems, this paper proposes a Fusion Hierarchical Structure Information Graph Neural Network for Extractive Long Documents Summarization. The model constructs a section node containing sentence nodes and global information according to the document structure. It integrates the hierarchical structure information of the text and uses position information to identify sentences. The section node acts as an intermediary node for information interaction between sentences, which better enriches the relationships between sentences and has higher computational efficiency. Our model has achieved excellent results on two datasets, PubMed and arXiv. Further analysis shows that the hierarchical structure information of documents helps the model select salient content better.
科研通智能强力驱动
Strongly Powered by AbleSci AI