Entity recognition in the field of coal mine construction safety based on a pre-training language model

煤矿开采 计算机科学 条件随机场 变压器 领域(数学) 领域知识 命名实体识别 独创性 人工智能 自然语言处理 数据挖掘 工程类 系统工程 电气工程 数学 电压 法学 政治学 纯数学 任务(项目管理) 废物管理 创造力
作者
Na Xu,Yanxiang Liang,Chaoran Guo,Bo Meng,Xueqing Zhou,Yuting Hu,Bo Zhang
出处
期刊:Engineering, Construction and Architectural Management [Emerald (MCB UP)]
标识
DOI:10.1108/ecam-05-2023-0512
摘要

Purpose Safety management plays an important part in coal mine construction. Due to complex data, the implementation of the construction safety knowledge scattered in standards poses a challenge. This paper aims to develop a knowledge extraction model to automatically and efficiently extract domain knowledge from unstructured texts. Design/methodology/approach Bidirectional encoder representations from transformers (BERT)-bidirectional long short-term memory (BiLSTM)-conditional random field (CRF) method based on a pre-training language model was applied to carry out knowledge entity recognition in the field of coal mine construction safety in this paper. Firstly, 80 safety standards for coal mine construction were collected, sorted out and marked as a descriptive corpus. Then, the BERT pre-training language model was used to obtain dynamic word vectors. Finally, the BiLSTM-CRF model concluded the entity’s optimal tag sequence. Findings Accordingly, 11,933 entities and 2,051 relationships in the standard specifications texts of this paper were identified and a language model suitable for coal mine construction safety management was proposed. The experiments showed that F1 values were all above 60% in nine types of entities such as security management. F1 value of this model was more than 60% for entity extraction. The model identified and extracted entities more accurately than conventional methods. Originality/value This work completed the domain knowledge query and built a Q&A platform via entities and relationships identified by the standard specifications suitable for coal mines. This paper proposed a systematic framework for texts in coal mine construction safety to improve efficiency and accuracy of domain-specific entity extraction. In addition, the pretraining language model was also introduced into the coal mine construction safety to realize dynamic entity recognition, which provides technical support and theoretical reference for the optimization of safety management platforms.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
zzzzzx发布了新的文献求助10
2秒前
刘郑大王发布了新的文献求助30
3秒前
苗啊苗完成签到,获得积分10
4秒前
等等完成签到,获得积分10
5秒前
7秒前
白白不焦虑完成签到,获得积分10
14秒前
18秒前
科研通AI2S应助zzzzzx采纳,获得10
18秒前
21秒前
523完成签到,获得积分10
21秒前
sadascaqwqw发布了新的文献求助10
22秒前
彳亍1117应助舟舟采纳,获得20
23秒前
23秒前
23秒前
合一海盗发布了新的文献求助10
24秒前
26秒前
科研通AI2S应助彼得大帝采纳,获得10
27秒前
liu1发布了新的文献求助10
28秒前
bkagyin应助虚设采纳,获得10
30秒前
30秒前
ZhiyunXu2012完成签到 ,获得积分10
31秒前
万能图书馆应助趙途嘵生采纳,获得10
33秒前
33秒前
34秒前
111发布了新的文献求助10
35秒前
35秒前
38秒前
默默的网络完成签到,获得积分10
40秒前
40秒前
Nara997发布了新的文献求助10
41秒前
42秒前
43秒前
虚设发布了新的文献求助10
44秒前
完美世界应助星星采纳,获得10
46秒前
46秒前
Nara997完成签到,获得积分10
48秒前
48秒前
达到顶峰发布了新的文献求助10
48秒前
共享精神应助小白采纳,获得10
49秒前
高分求助中
LNG地下式貯槽指針(JGA指-107-19)(Recommended practice for LNG inground storage) 1000
Second Language Writing (2nd Edition) by Ken Hyland, 2019 1000
rhetoric, logic and argumentation: a guide to student writers 1000
QMS18Ed2 | process management. 2nd ed 1000
Eric Dunning and the Sociology of Sport 850
Operative Techniques in Pediatric Orthopaedic Surgery 510
A High Efficiency Grating Coupler Based on Hybrid Si-Lithium Niobate on Insulator Platform 500
热门求助领域 (近24小时)
化学 医学 材料科学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 免疫学 细胞生物学 电极
热门帖子
关注 科研通微信公众号,转发送积分 2921394
求助须知:如何正确求助?哪些是违规求助? 2564125
关于积分的说明 6935249
捐赠科研通 2221649
什么是DOI,文献DOI怎么找? 1180926
版权声明 588787
科研通“疑难数据库(出版商)”最低求助积分说明 577770