已入深夜,您辛苦了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!祝你早点完成任务,早点休息,好梦!

HDP-CNN: Highway deep pyramid convolution neural network combining word-level and character-level representations for phishing website detection

计算机科学 卷积神经网络 文字嵌入 网络钓鱼 深度学习 嵌入 特征(语言学) 人工智能 棱锥(几何) 性格(数学) 词(群论) 特征学习 欺骗攻击 代表(政治) 特征工程 支持向量机 互联网 万维网 计算机安全 物理 哲学 光学 政治 法学 语言学 数学 政治学 几何学
作者
Faan Zheng,Qiao Yan,Victor C. M. Leung,F. Richard Yu,Zhong Ming
出处
期刊:Computers & Security [Elsevier BV]
卷期号:114: 102584-102584 被引量:18
标识
DOI:10.1016/j.cose.2021.102584
摘要

Phishing has become a prevailing method for attackers to steal users' private data and commit fraud, posing a serious threat to Internet users. How to detect phishing websites has attracted great interests from both academia and industry. A popular approach is to use support vector machine (SVM) to detect phishing websites. However, this approach relies on extracting features designated by experts, and the prediction effectiveness of the model is greatly affected by the quality of feature extraction. In addition, it cannot handle features that are not identifiable. Deep learning methods therefore become popular as they do not require manual feature engineering. However, many deep learning methods can only learn feature information of uniform resource locators (URLs) at the character level, while ignoring the intrinsic connections of words. To address these limitations, we propose a novel highway deep pyramid convolution neural network (HDP-CNN), a deep convolutional network that combines character-level and word-level representation information. HDP-CNN first receives the URL string sequences as input, then performs character-level embedding and word-level embedding respectively. Afterward, it uses the Highway network to connect the character-level embedding representation and word-level embedding representation of the URL and extracts local features of different sizes from the region embedding layer. Finally, it passes them into the designed deep pyramid structure network to capture the global representation of the URL. Our experiments illustrate that the information expressed by embedding vectors of different granularities has subtle differences. By combining embedding feature information of different granularities, HDP-CNN exhibits better performance than methods based on single embedding feature information. In our experiments, we construct an imbalanced dataset that has the ratio of benign websites to phishing websites is close to 5:1. The experimental results demonstrate that our method outperforms other methods, with accuracy at 98.30%, true positive rate (TPR) at 99.18%, and true negative rate (TNR) at 94.34%.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
通通发布了新的文献求助10
刚刚
1122846发布了新的文献求助10
1秒前
科研通AI6.3应助娜娜采纳,获得10
2秒前
木呆完成签到,获得积分10
3秒前
3秒前
4秒前
大包鸡完成签到 ,获得积分10
6秒前
只只发布了新的文献求助10
8秒前
木呆发布了新的文献求助10
9秒前
惜海完成签到,获得积分20
9秒前
11秒前
彭于晏应助自信语雪采纳,获得10
12秒前
lunar完成签到 ,获得积分10
13秒前
奋斗蚂蚁完成签到 ,获得积分10
14秒前
orixero应助Lily采纳,获得10
15秒前
SciGPT应助惜海采纳,获得10
15秒前
科研通AI6.2应助高帮白袜采纳,获得10
17秒前
科研通AI6.4应助娜娜采纳,获得10
21秒前
22秒前
芋头喵喵完成签到,获得积分10
26秒前
vkey完成签到,获得积分10
28秒前
28秒前
Hhh完成签到 ,获得积分10
32秒前
33秒前
69完成签到,获得积分10
34秒前
35秒前
桃花债发布了新的文献求助20
38秒前
39秒前
SciGPT应助陈也许采纳,获得20
39秒前
传奇3应助健忘语风采纳,获得10
39秒前
科研通AI2S应助娜娜采纳,获得10
42秒前
Finch完成签到,获得积分10
49秒前
倒霉的芒果完成签到 ,获得积分10
49秒前
54秒前
小张完成签到 ,获得积分10
56秒前
隐形曼青应助1122846采纳,获得10
57秒前
BX发布了新的文献求助10
58秒前
1分钟前
1分钟前
Lily发布了新的文献求助10
1分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Kinesiophobia : a new view of chronic pain behavior 2000
Cytological studies on Phanerogams in Southern Peru. I. Karyotype of Acaena ovalifolia 2000
Cronologia da história de Macau 1600
Earth System Geophysics 1000
Bioseparations Science and Engineering Third Edition 1000
Lloyd's Register of Shipping's Approach to the Control of Incidents of Brittle Fracture in Ship Structures 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 纳米技术 有机化学 物理 生物化学 化学工程 计算机科学 复合材料 内科学 催化作用 光电子学 物理化学 电极 冶金 遗传学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 6123984
求助须知:如何正确求助?哪些是违规求助? 7951696
关于积分的说明 16498245
捐赠科研通 5244702
什么是DOI,文献DOI怎么找? 2801522
邀请新用户注册赠送积分活动 1782881
关于科研通互助平台的介绍 1654133