计算机科学
稳健性(进化)
数据挖掘
自然语言
风险评估
自然语言处理
人工智能
数据科学
风险分析(工程)
机器学习
计算机安全
生物化学
医学
基因
化学
作者
Mohammad Zaid Kamil,Mohammed Taleb‐Berrouane,Faisal Khan,Paul Amyotte,Salim Ahmed
出处
期刊:Risk Analysis
[Wiley]
日期:2023-01-22
卷期号:43 (10): 2033-2052
被引量:4
摘要
Abstract Underlying information about failure, including observations made in free text, can be a good source for understanding, analyzing, and extracting meaningful information for determining causation. The unstructured nature of natural language expression demands advanced methodology to identify its underlying features. There is no available solution to utilize unstructured data for risk assessment purposes. Due to the scarcity of relevant data, textual data can be a vital learning source for developing a risk assessment methodology. This work addresses the knowledge gap in extracting relevant features from textual data to develop cause–effect scenarios with minimal manual interpretation. This study applies natural language processing and text‐mining techniques to extract features from past accident reports. The extracted features are transformed into parametric form with the help of fuzzy set theory and utilized in Bayesian networks as prior probabilities for risk assessment. An application of the proposed methodology is shown in microbiologically influenced corrosion‐related incident reports available from the Pipeline and Hazardous Material Safety Administration database. In addition, the trained named entity recognition (NER) model is verified on eight incidents, showing a promising preliminary result for identifying all relevant features from textual data and demonstrating the robustness and applicability of the NER method. The proposed methodology can be used in domain‐specific risk assessment to analyze, predict, and prevent future mishaps, ameliorating overall process safety.
科研通智能强力驱动
Strongly Powered by AbleSci AI