随机森林
计算机科学
初始化
网络钓鱼
机器学习
人口
特征选择
精确性和召回率
数据挖掘
人工智能
互联网
万维网
社会学
人口学
程序设计语言
作者
Erzhou Zhu,Zhile Chen,Jie Cui,Hong Zhong
出处
期刊:IEEE Transactions on Network and Service Management
[Institute of Electrical and Electronics Engineers]
日期:2022-03-29
卷期号:19 (4): 4461-4478
被引量:14
标识
DOI:10.1109/tnsm.2022.3162885
摘要
To effectively boost computer usage, machine learning models are used in several phishing detection systems to classify enormous phishing datasets. Based on phishing patterns, researchers prefer to extract a considerable number of features to improve phishing detection performance. However, redundant and useless features in the feature set degrade the performance of the underlying classification models. In addition, several existing phishing detection models mainly focus on detection accuracy and overlook recall rates. However, in phishing detection, it is more harmful to falsely detect a phishing website as a legitimate website than it is to detect a legitimate website as a phishing website. This study proposes a novel phishing detection model, multi-objective evolution/random forest (MOE/RF), which is based on the revised multi-objective evolution optimization algorithm (MOE) and random forest (RF). The MOE/RF model uses accuracy as the detection target and minimizes the probability of false detection of phishing sites. In addition, two new strategies, the symmetric uncertainty-based population initialization and the population state-based adaptive environmental selection, are proposed to improve the performance of the MOE. Experimental results on testing five different phishing datasets demonstrated that the MOE/RF performs superior to several existing methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI