计算机科学
模块化设计
图形
人工智能
数据挖掘
机器学习
集合(抽象数据类型)
节点(物理)
利用
基线(sea)
模式识别(心理学)
理论计算机科学
海洋学
计算机安全
结构工程
工程类
程序设计语言
地质学
操作系统
作者
Anjan Chowdhury,Sriram Srinivasan,Animesh Mukherjee,Sanjukta Bhowmick,Kuntal Ghosh
出处
期刊:ACM Transactions on Knowledge Discovery From Data
[Association for Computing Machinery]
日期:2023-09-06
卷期号:18 (1): 1-31
摘要
Graph Neural Networks (GNNs) are a popular machine learning framework for solving various graph processing applications. This framework exploits both the graph topology and the feature vectors of the nodes. One of the important applications of GNN is in the semi-supervised node classification task. The accuracy of the node classification using GNN depends on (i) the number and (ii) the choice of the training nodes. In this article, we demonstrate that increasing the training nodes by selecting nodes from the same class that are spread out across non-contiguous subgraphs, can significantly improve the accuracy. We accomplish this by presenting a novel input intervention technique that can be used in conjunction with different GNN classification methods to increase the non-contiguous training nodes and, thereby, improve the accuracy. We also present an output intervention technique to identify misclassified nodes and relabel them with their potentially correct labels. We demonstrate on real-world networks that our proposed methods, both individually and collectively, significantly improve the accuracy in comparison to the baseline GNN algorithms. Both our methods are agnostic. Apart from the initial set of training nodes generated by the baseline GNN methods, our techniques do not need any other extra knowledge about the classes of the nodes. Thus, our methods are modular and can be used as pre-and post-processing steps with many of the currently available GNN methods to improve their accuracy.
科研通智能强力驱动
Strongly Powered by AbleSci AI