计算机科学
初始化
短语
人工智能
机器翻译
自然语言处理
判决
光学(聚焦)
任务(项目管理)
简单(哲学)
平行语料库
无监督学习
翻译(生物学)
文本简化
机器学习
程序设计语言
哲学
生物化学
物理
化学
管理
认识论
信使核糖核酸
光学
经济
基因
作者
Jipeng Qiang,Xindong Wu
出处
期刊:IEEE Transactions on Knowledge and Data Engineering
[Institute of Electrical and Electronics Engineers]
日期:2019-10-16
卷期号:33 (4): 1802-1806
被引量:17
标识
DOI:10.1109/tkde.2019.2947679
摘要
Most recent approaches for Text Simplification (TS) have drawn on insights from machine translation to learn simplification rewrites from the monolingual parallel corpus of complex and simple sentences, yet their effectiveness strongly relies on large amounts of parallel sentences. However, there has been a serious problem haunting TS for decades, that is, the availability of parallel TS corpora is scarce or not fit for the learning task. In this paper, we will focus on one especially useful and challenging problem of unsupervised TS without a single parallel sentence. To the best of our knowledge, we present the first unsupervised text simplification system based on phrase-based machine translation system, which leverages a careful initialization of phrase tables and language models. On the widely used WikiLarge and WikiSmall benchmarks, our system respectively obtains 39.08 and 25.12 SARI points, even outperforms some supervised baselines.
科研通智能强力驱动
Strongly Powered by AbleSci AI