判别式
假新闻
计算机科学
集合(抽象数据类型)
图形
二元分类
可靠性(半导体)
二进制数
机器学习
基线(sea)
钥匙(锁)
标记数据
人工智能
模式识别(心理学)
互联网隐私
支持向量机
数学
理论计算机科学
功率(物理)
海洋学
物理
算术
计算机安全
量子力学
程序设计语言
地质学
作者
Mariana Caravanti de Souza,Marcos Paulo Silva Gôlo,Alípio Mário Guedes Jorge,Evelin Amorim,Ricardo Campos,Ricardo Marcondes Marcacini,Solange Oliveira Rezende
标识
DOI:10.1016/j.ins.2024.120300
摘要
Fake news detection (FND) tools are essential to increase the reliability of information in social media. FND can be approached as a machine learning classification problem so that discriminative features can be automatically extracted. However, this requires a large news set, which in turn implies a considerable amount of human experts' effort for labeling. In this paper, we explore Positive and Unlabeled Learning (PUL) to reduce the labeling cost. In particular, we improve PUL with the network-based Label Propagation (PU-LP) algorithm. PU-LP achieved competitive results in FND exploiting relations between news and terms and using few labeled fake news. We propose integrating an attention mechanism in PU-LP that can define which terms in the network are more relevant for detecting fake news. We use GNEE, a state-of-the-art algorithm based on graph attention networks. Our proposal outperforms state-of-the-art methods, improving F1 in 2% to 10%, especially when only 10% labeled fake news are available. It is competitive with the binary baseline, even when nearly half of the data is labeled. Discrimination ability is also visualized through t-SNE. We also present an analysis of the limitations of our approach according to the type of text found in each dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI