Improving Deep Learning Based Password Guessing Models Using Pre-processing

计算机科学 人工智能 密码 深度学习 字典攻击 预处理器 人工神经网络 自编码 机器学习 计算机网络
作者
Yuxuan Wu,Ding Wang,Yunkai Zou,Ziyi Huang
出处
期刊:Lecture Notes in Computer Science 卷期号:: 163-183
标识
DOI:10.1007/978-3-031-15777-6_10
摘要

Passwords are the most widely used authentication method and play an important role in users' digital lives. Password guessing models are generally used to understand password security, yet statistic-based password models (like the Markov model and probabilistic context-free grammars (PCFG)) are subject to the inherent limitations of overfitting and sparsity. With the improvement of computing power, deep-learning based models with higher crack rates are emerging. Since neural networks are generally used as black boxes for learning password features, a key challenge for deep-learning based password guessing models is to choose the appropriate preprocessing methods to learn more effective features. To fill the gap, this paper explores three new preprocessing methods and makes an attempt to apply them to two promising deep-learning networks, i.e., Long Short-Term Memory (LSTM) neural networks and Generative Adversarial Networks (GAN). First, we propose a character-feature based method for encoding to replace the canonical one-hot encoding. Second, we add so far the most comprehensive recognition rules of words, keyboard patterns, years, and website names into the basic PCFG, and find that the frequency distribution of extracted segments follows the Zipf's law. Third, we adopt Xu et al.'s PCFG improvement with chunk segmentation at CCS'21, and study the performance of the Chunk+PCFG preprocessing method when applied to LSTM and GAN. Extensive experiments on six large real-world password datasets show the effectiveness of our preprocessing methods. Results show that within 50 million guesses: 1) When we apply the PCFG preprocessing method to PassGAN (a GAN-based password model proposed by Hitja et al. at ACNS'19), 13.83%–38.81% (26.79% on average) more passwords can be cracked; 2) Our LSTM based model using PCFG for preprocessing (short for PL) outperforms Wang et al.'s original PL model by 0.35%–3.94% (1.36% on average). Overall, our preprocessing methods can improve the attacking rates in four over seven tested cases. We believe this work provides new feasible directions for guessing optimization, and contributes to a better understanding of deep-learning based models.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
CC完成签到 ,获得积分10
1秒前
李安全完成签到,获得积分10
2秒前
jhxie完成签到,获得积分0
2秒前
walker007发布了新的文献求助20
2秒前
3秒前
b不为谁而作的歌完成签到,获得积分10
4秒前
ding7862完成签到,获得积分10
4秒前
5秒前
arniu2008发布了新的文献求助10
8秒前
香丿完成签到 ,获得积分10
8秒前
36456657完成签到,获得积分0
8秒前
忧郁凌波完成签到,获得积分10
10秒前
12秒前
球球发布了新的文献求助10
12秒前
fishswim1完成签到,获得积分10
13秒前
木子李完成签到 ,获得积分10
14秒前
Balance Man完成签到 ,获得积分10
15秒前
YYMY2022完成签到,获得积分10
15秒前
noahxinny完成签到,获得积分10
15秒前
lhr完成签到,获得积分10
16秒前
16秒前
安平完成签到,获得积分10
16秒前
冬瓜鑫完成签到,获得积分10
19秒前
负责以山完成签到 ,获得积分10
20秒前
Eloise完成签到,获得积分10
21秒前
21秒前
5AGAME完成签到,获得积分10
22秒前
LMBxXw完成签到,获得积分10
23秒前
辛勤誉完成签到,获得积分10
23秒前
Haibrar完成签到 ,获得积分10
23秒前
lalala应助科研通管家采纳,获得10
25秒前
lalala应助科研通管家采纳,获得10
25秒前
lalala应助科研通管家采纳,获得10
25秒前
lalala应助科研通管家采纳,获得10
25秒前
efengmo完成签到,获得积分10
25秒前
陈陈完成签到 ,获得积分10
26秒前
akanenn999完成签到,获得积分10
26秒前
FashionBoy应助球球采纳,获得10
32秒前
penzer完成签到 ,获得积分0
33秒前
埃塞克斯完成签到,获得积分0
34秒前
高分求助中
Malcolm Fraser : a biography 680
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
Climate change and sports: Statistics report on climate change and sports 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Organic Reactions Volume 118 400
A Foreign Missionary on the Long March: The Unpublished Memoirs of Arnolis Hayman of the China Inland Mission 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6459213
求助须知:如何正确求助?哪些是违规求助? 8268378
关于积分的说明 17621595
捐赠科研通 5528363
什么是DOI,文献DOI怎么找? 2905909
邀请新用户注册赠送积分活动 1882638
关于科研通互助平台的介绍 1727743