计算机科学
命名实体识别
Softmax函数
领域(数学分析)
自然语言处理
人工智能
功率(物理)
构造(python库)
情报检索
深度学习
任务(项目管理)
工程类
数学分析
程序设计语言
物理
数学
系统工程
量子力学
作者
Jun Feng,Hongkai Wang,Liangying Peng,Yidan Wang,Haomin Song,Guo Hong-ju
出处
期刊:Communications in computer and information science
日期:2024-01-01
卷期号:: 133-146
标识
DOI:10.1007/978-981-99-9614-8_9
摘要
The field of electrical power encompasses a vast array of diverse information modalities, with textual data standing as a pivotal constituent of this domain. In this study, we harness an extensive corpus of textual data drawn from the electrical power systems domain, comprising regulations, reports, and other pertinent materials. Leveraging this corpus, we construct an Electrical Power Systems Corpus and proceed to annotate entities within this text, thereby introducing a novel Named Entity Recognition (NER) dataset tailored specifically for the electrical power domain. We employ an end-to-end deep learning model, the BERT-BiLSTM-CRF model, for named entity recognition on our custom electrical power domain dataset. This NER model integrates the BERT pre-trained model into the traditional BiLSTM-CRF model, enhancing its ability to capture contextual and semantic information within the text. Results demonstrate that the proposed model outperforms both the BiLSTM-CRF model and the BERT-softmax model in NER tasks across the electrical power domain and various other domains. This study contributes to the advancement of NER applications in the electrical power domain and holds significance for furthering the construction of knowledge graphs and databases related to electrical power systems.
科研通智能强力驱动
Strongly Powered by AbleSci AI