正确性
计算机科学
自动汇总
可读性
冗余(工程)
自然语言处理
人工智能
判决
元组
强化学习
解析
信息抽取
语法性
机器学习
情报检索
程序设计语言
语法
语言学
数学
离散数学
操作系统
哲学
作者
Mengli Zhang,Gang Zhou,Wanting Yu,Wenfen Liu
标识
DOI:10.1016/j.ipm.2020.102478
摘要
Automatic summarization systems provide an effective solution to today's unprecedented growth of textual data. For real-world tasks, such as data mining and information retrieval, the factual correctness of generated summary is critical. However, existing models usually focus on improving the informativeness rather than optimizing factual correctness. In this work, we present a Fact-Aware Reinforced Abstractive Sentence Summarization framework to improve the factual correctness of neural abstractive summarization models, denoted as FAR-ASS. Specifically, we develop an automatic fact extraction scheme leveraging OpenIE (Open Information Extraction) and dependency parser tools to extract structured fact tuples. Then, to quantitatively evaluate the factual correctness, we define a factual correctness score function that considers the factual accuracy and factual redundancy. We further propose to adopt reinforcement learning to improve readability and factual correctness by jointly optimizing a mixed-objective learning function. We use the English Gigaword and DUC 2004 datasets to evaluate our model. Experimental results show that compared with competitive models, our model significantly improves the factual correctness and readability of generated summaries, and also reduces duplicates while improving the informativeness.
科研通智能强力驱动
Strongly Powered by AbleSci AI