计算机科学
微博
随机森林
超参数
社会化媒体
人气
数据挖掘
滤波器(信号处理)
机器学习
人工智能
万维网
心理学
计算机视觉
社会心理学
作者
Sepideh Bazzaz Abkenar,Ebrahim Mahdipour,Seyed Mahdi Jameii,Mostafa Haghi Kashani
摘要
Summary Social networking services are online platforms that are distributed across different computers over long distances. Twitter is the most popular microblogging site that allows users to share their opinions and real‐world events. Due to its popularity and ease of use, Twitter has also attracted spammers. As a result, spam detection is one of the most critical problems. In order to provide a spam‐free environment, it is necessary to identify and filter spam tweets as well as their owners. A hybrid method, which is based on Synthetic Minority Over‐sampling TEchnique (SMOTE) and Differential Evolution (DE) strategies, is presented to enhance the spam detection rate in real Twitter datasets. SMOTE is applied to tackle the imbalanced class distribution of datasets, while DE is used to tune Random Forest (RF) hyperparameters. Compared with related work and based on evaluation results, the presented method significantly enhances the classification performance in imbalanced datasets. The detection rate of optimized RF with excellent F 1 ‐score and Area Under the Receiver Operating Characteristic Curve (AUROC), which are 98.97% and 0.999, respectively, demonstrates the high efficiency of the proposed method.
科研通智能强力驱动
Strongly Powered by AbleSci AI