Fake news detection: Taxonomy and comparative study

计算机科学分类人工智能分类学（生物学）特征提取机器学习特征（语言学）代表（政治）数据挖掘语言学哲学植物政治政治学法学生物

作者

Faramarz Farhangian,Rafael M. O. Cruz,George D. C. Cavalcanti

出处

期刊：Information Fusion [Elsevier]
日期：2023-11-14 卷期号：103: 102140-102140 被引量：17

标识

DOI：10.1016/j.inffus.2023.102140

摘要

The proliferation of social networks has presented a significant challenge in combating the pervasive issue of fake news within modern societies. Due to the large amount of information and news produced daily in text, audio, and video, the validation and verification of this information have become crucial tasks. Leveraging advancements in artificial intelligence, distinguishing between fake news and factual information through automatic fake news detection systems has become more feasible. Automatic fake news detection has been explored from diverse perspectives, employing various feature extraction and classification models. Nonetheless, empirical evaluations, categorization, and comparisons of existing techniques for handling this problem remain limited. In this paper, we revisit the definitions and perspectives of fake news and propose an updated taxonomy for the field based on multiple criteria: (1) Type of features used in fake news detection; (2) Fake news detection perspectives; (3) Feature representation methods; and (4) Classification approaches. Moreover, we conduct an extensive empirical study to evaluate several feature representation techniques and classification approaches based on accuracy and computational cost. Our experimental results demonstrate that the optimal feature extraction techniques vary depending on the characteristics of the dataset. Notably, context-dependent models based on transformer models consistently exhibit superior performance. Additionally, employing transformer models as feature extraction methods, rather than solely fine-tuning the network for the downstream task, improves overall performance. Through extensive error analysis, we identify that a combination of feature representation methods and classification algorithms, including classical ones, offer complementary aspects and should be considered for achieving better generalization performance while maintaining a relatively low computational cost. For further details, including source codes, figures, and datasets, please refer to our project's GitHub repository: [https://github.com/FFarhangian/Fake-news-detection-Comparative-Study].

求助该文献

最长约 10秒，即可获得该文献文件

Fake news detection: Taxonomy and comparative study

今日热心研友