支持向量机
人工智能
计算机科学
卷积神经网络
二元曲线
骗局
机器学习
特征提取
tf–国际设计公司
相似性(几何)
特征(语言学)
余弦相似度
模式识别(心理学)
三元曲线
期限(时间)
物理
替代医学
语言学
病理
哲学
量子力学
图像(数学)
医学
作者
Manuel Benedict,Erwin Budi Setiawan
标识
DOI:10.1109/icoict58202.2023.10262433
摘要
Hoax news has long been a problem for society that is quite worrying because receiving hoax news can change a person's point of view to something that is not good, the impact of which is detrimental to many individuals and groups of people. Machine learning and deep learning can be implemented to detect hoax news. Examples of methods used in previous studies are SVM (Support Vector Machine) and CNN (Convolutional Neural Network). This research proposes the application of the CNN and SVM methods. In addition, this research develops a CNN-SVM hybrid model, which is the uniqueness of this research. The dataset is sourced from Twitter which focuses on the Ferdy Sambo Case and the Kanjuruhan Tragedy that will occur in 2022. The dataset amounts to 25,325 and is divided into two with a splitting ratio of 90:10. After three algorithms was trained, they achieved excellent performance. This matter can be seen from the accuracy scores for the two methods, which managed to improve their performance after feature extraction and expansion were applied with TF-IDF (Term Frequency Inverse Document Frequency) feature extraction, unigram + bigram weighting, and feature expansion with GloVe (Global Vector for Word Representation). The highest performance model is the SVM model with the similarity top 1 and Tweet corpus (95.95% accuracy), followed by the hybrid CNN-SVM model with the similarity top 10 and Tweet + News corpus (95.79% accuracy) and CNN model with the similarity top 15 with Tweet + News corpus (95.11% accuracy).
科研通智能强力驱动
Strongly Powered by AbleSci AI