计算机科学
命名实体识别
自然语言处理
人工智能
条件随机场
CRF公司
文字嵌入
背景(考古学)
任务(项目管理)
构造(python库)
实体链接
汉字
嵌入
依赖关系(UML)
领域(数学)
知识库
纯数学
程序设计语言
管理
经济
古生物学
生物
数学
作者
Yu Wang,Bin Xia,Zheng Liu,Yun Li,Tao Li
标识
DOI:10.1109/iske.2017.8258773
摘要
Named Entity Recognition (NER) is a basic task in Natural Language Processing (NLP), which extracts the meaningful named entities from the text. Compared with the English NER, the Chinese NER is more challenge, since there is no tense in the Chinese language. Moreover, the omissions and the Internet catchwords in the Chinese corpus make the NER task more difficult. Traditional machine learning methods (e.g., CRFs) cannot address the Chinese NER effectively because they are hard to learn the complicated context in the Chinese language. To overcome the aforementioned problem, we propose a deep learning model Char2Vec+Bi-LSTMs for Chinese NER. We use the Chinese character instead of the Chinese word as the embedding unit, and the Bi-LSTMs is used to learn the complicated semantic dependency. To evaluate our proposed model, we construct the corpus from the China TELECOM FAQs. Experimental results show that our model achieves better performance than other baseline methods and the character embedding is more appropriate than the word embedding in the Chinese language.
科研通智能强力驱动
Strongly Powered by AbleSci AI