计算机科学
人工神经网络
分类器(UML)
因果模型
一般化
图形
人工智能
数据挖掘
理论计算机科学
机器学习
数学
统计
数学分析
作者
Hao Wang,Ruonan Liu,Steven X. Ding,Qinghua Hu,Zengxiang Li,Hongkuan Zhou
出处
期刊:IEEE Transactions on Industrial Informatics
[Institute of Electrical and Electronics Engineers]
日期:2024-02-01
卷期号:20 (2): 1987-1996
被引量:1
标识
DOI:10.1109/tii.2023.3282979
摘要
In modern industrial systems, components have complex interactions with each other, which makes it become a challenging task to identify the operational conditions of industrial systems. Considering that an industrial system, the embedded components and their interactions can be expressed as nodes and edges in a graph, respectively. Therefore, graph representation algorithms are powerful tools for fault diagnosis of industrial systems. As one of the most commonly used graph representation algorithms, Graph Neural Networks (GNN) mainly follow the law of “learning to attend”. GNN extract training data features, learn the statistical correlations between features and labels, resulting in the attended graph favoring for accessing non-causal features as a shortcut for prediction. This shortcut feature is unstable and depends on the data distribution characteristics in the training dataset, which reduces the generalization ability of the classifier. By performing the causal analysis of GNN modeling for graph representation, the results show that shortcut features act as confounding factors between causal features and predictions, causing classifiers to learn wrong correlations. Therefore, to discover patterns of causality and weaken the confounding effects of shortcut features, a Causal-Trivial Attention Graph Neural Network (CTA-GNN) strategy is proposed. Firstly, node and edge representations are given by estimating soft masks. Secondly, through disentanglement, both causal features and shortcut features are obtained from the graph. Thirdly, the backdoor adjustment of the causal theory is parameterized to combine each causal feature with a variety of shortcut features. Finally, comparative experiments on the Three-Phase Flow Facility (TFF) dataset illustrate the effectiveness of the proposed method.
科研通智能强力驱动
Strongly Powered by AbleSci AI