计算机科学
有效载荷(计算)
加密
交通分类
嵌入
特征提取
人工智能
数据挖掘
字节
机器学习
计算机网络
网络数据包
计算机硬件
作者
Hong Ye He,Zhi Guo Yang,Xiang Ning Chen
标识
DOI:10.23919/ituk50268.2020.9303204
摘要
Traffic identification becomes more important yet more challenging as related encryption techniques are rapidly developing nowadays. In difference to recent deep learning methods that apply image processing to solve such encrypted traffic problems, in this paper, we propose a method named Payload Encoding Representation from Transformer (PERT) to perform automatic traffic feature extraction using a state-of-the-art dynamic word embedding technique. Based on this, we further provide a traffic classification framework in which unlabeled traffic is utilized to pre-train an encoding network that learns the contextual distribution of traffic payload bytes. Then, the downward classification reuses the pre-trained network to obtain an enhanced classification result. By implementing experiments on a public encrypted traffic data set and our captured Android HTTPS traffic, we prove the proposed method can achieve an obvious better effectiveness than other compared baselines. To the best of our knowledge, this is the first time the encrypted traffic classification with the dynamic word embedding alone with its pre-training strategy has been addressed.
科研通智能强力驱动
Strongly Powered by AbleSci AI