Abstractive text summarization using deep learning with a new Turkish summarization benchmark dataset

自动汇总 计算机科学 标题 水准点(测量) 胭脂 情报检索 人工智能 土耳其 自然语言处理 深度学习 判决 多文档摘要 排名(信息检索) 语言学 地理 哲学 大地测量学
作者
Fatih Ertam,Galip Aydın
出处
期刊:Concurrency and Computation: Practice and Experience [Wiley]
卷期号:34 (9) 被引量:10
标识
DOI:10.1002/cpe.6482
摘要

Abstract Exponential increase in the amount of textual data made available on the Internet results in new challenges in terms of accessing information accurately and quickly. Text summarization can be defined as reducing the dimensions of the expressions to be summarized without spoiling the meaning. Summarization can be performed as extractive and abstractive or using both together. In this study, we focus on abstractive summarization which can produce more human‐like summarization results. For the study we created a Turkish news summarization benchmark dataset from various news agency web portals by crawling the news title, short news, news content, and keywords for the last 5 years. The dataset is made publicly available for researchers. The deep learning network training was carried out by using the news headlines and short news contents from the prepared dataset and then the network was expected to create the news headline as the short news summary. To evaluate the performance of this study, Rouge‐1, Rouge‐2, and Rouge‐L were compared using precision, sensitivity and F1 measure scores. Performance values for the study were presented for each sentence as well as by averaging the results for 50 randomly selected sentences. The F1 Measure values are 0.4317, 0.2194, and 0.4334 for Rouge‐1, Rouge‐2, and Rouge‐L respectively. Performance results show that the approach is promising for Turkish text summarization studies and the prepared dataset will add value to the literature.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
xixixii发布了新的文献求助10
1秒前
RimutO0530发布了新的文献求助10
1秒前
玉七发布了新的文献求助10
1秒前
思源应助小橘子采纳,获得10
1秒前
易楠发布了新的文献求助10
2秒前
2秒前
木易完成签到,获得积分10
3秒前
3秒前
sweet发布了新的文献求助10
4秒前
笨笨如之发布了新的文献求助10
4秒前
13728891737发布了新的文献求助30
4秒前
4秒前
木子不甜完成签到,获得积分10
4秒前
英姑应助不喜采纳,获得10
5秒前
5秒前
71333197发布了新的文献求助10
5秒前
6秒前
7秒前
jjj完成签到,获得积分10
7秒前
选民很头疼完成签到,获得积分10
8秒前
hitagi发布了新的文献求助10
8秒前
端庄梦松发布了新的文献求助10
8秒前
9秒前
9秒前
酷波er应助易楠采纳,获得10
9秒前
要苦就苦别人完成签到,获得积分10
9秒前
9秒前
jqian发布了新的文献求助30
10秒前
10秒前
jianguo发布了新的文献求助10
10秒前
wa完成签到,获得积分20
10秒前
大腚疯猪应助老实的文龙采纳,获得20
10秒前
科研通AI6.2应助TenFire采纳,获得30
11秒前
落京关注了科研通微信公众号
11秒前
傻傻的怡完成签到 ,获得积分10
11秒前
高欢完成签到,获得积分10
11秒前
11秒前
852应助Tao采纳,获得10
11秒前
可靠皮皮虾完成签到,获得积分20
11秒前
高分求助中
卤化钙钛矿人工突触的研究 1000
Engineering for calcareous sediments : proceedings of the International Conference on Calcareous Sediments, Perth 15-18 March 1988 / edited by R.J. Jewell, D.C. Andrews 1000
Wolffs Headache and Other Head Pain 9th Edition 1000
Continuing Syntax 1000
Signals, Systems, and Signal Processing 510
Cardiac structure and function of elite volleyball players across different playing positions 500
CLSI H26-A2 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6241448
求助须知:如何正确求助?哪些是违规求助? 8065476
关于积分的说明 16833419
捐赠科研通 5319735
什么是DOI,文献DOI怎么找? 2832817
邀请新用户注册赠送积分活动 1810224
关于科研通互助平台的介绍 1666760