命名实体识别
计算机科学
术语
领域(数学)
词(群论)
依赖关系(UML)
自然语言处理
人工智能
情报检索
语言学
数学
工程类
哲学
系统工程
纯数学
任务(项目管理)
作者
Yu Lan,Hongguang Xu,Ke Xu
标识
DOI:10.1109/asid50160.2020.9271708
摘要
Most of the researches on Chinese named entity recognition (NER) focus on the general field, and few on NER in the field of science and technology. On one hand, technical terms in the field of science and technology appear in general texts less frequently, with most of which being compound words, and the performance of word segmentation processing on texts in the field of science and technology is poor. On the other hand, texts in the field of science and technology are more accurate and standardized than those in general fields. By analyzing these characteristics of texts in the field of science and technology, this paper attempts to train word vectors by constructing terminology dictionaries and introducing dependency analysis. Referring to the latest NER research results in the current Chinese general field, i.e. the method of merging character vectors with word vectors, we will perform NER on texts in the field of science and technology. Through experiments, it is proved that the proposed method is more effective comparing to existing works. In addition, the effect of introducing attention mechanism on NER results is also studied.
科研通智能强力驱动
Strongly Powered by AbleSci AI