计算机科学
自然语言处理
依赖关系(UML)
判决
人工智能
语法
错误检测和纠正
变压器
拼写
光学(聚焦)
语法错误
抽象语法
语言学
算法
哲学
物理
量子力学
电压
光学
作者
Wenxin Huang,Xiao Dong,Mengxiang Wang,Guangya Liu,Jianxing Yu,Huaijie Zhu,Jian Yin
标识
DOI:10.1007/978-981-99-8073-4_30
摘要
Existing research primarily focuses on spelling and grammatical errors in English, such as missing or wrongly adding characters. This kind of shallow error has been well-studied. Instead, there are many unsolved deep-level errors in real applications, especially in Chinese, among which semantic errors are one of them. Semantic errors are mainly caused by an inaccurate understanding of the meanings and usage of words. Few studies have investigated these errors. We thus focus on semantic error correction and propose a new dataset, called CSEC, which includes 17,116 sentences and six types of errors. Semantic errors are often found according to the dependency relations of sentences. We thus propose a novel method called Desket (Dependency Syntax Knowledge Enhanced Transformer). Desket solves the CSEC task by (1) capturing the syntax of the sentence, including dependency relations and part-of-speech tagging, and (2) using dependency to guide the generation of the correct output. Experiments on the CSEC dataset demonstrate the superior performance of our model against existing methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI