DOMR: Toward Deep Open-World Malware Recognition

计算机科学恶意软件人工智能遗忘机器学习再培训 Android（操作系统）推论深度学习代表（政治）计算机安全哲学法学国际贸易业务操作系统政治语言学政治学

作者

Tingting Lu,Junfeng Wang

出处

期刊：IEEE Transactions on Information Forensics and Security [Institute of Electrical and Electronics Engineers]
日期：2023-11-30 卷期号：19: 1455-1468 被引量：8

标识

DOI：10.1109/tifs.2023.3338469

摘要

Deep learning has been widely used for Android malware family recognition, but current deep learning-based approaches make the closed-world assumption that malware families encountered during testing are available at training phase. Unfortunately, this assumption is often violated in practice due to the constant emergence of novel categories and the huge cost of collecting abundant training classes, causing serious failures to the existing approaches. Accordingly, a new problem setting for Android malware family recognition is introduced, i.e., deep open-world malware recognition that poses two critical tasks: 1) Open recognition, aiming to not only classify malware from known families (present in training) but detect malware from unknown families (absent in training); 2) Incremental update, aiming to learn about the detected unknown/new categories without retraining from scratch and catastrophically forgetting the previously learned known/old classes. This paper formalizes the problem and proposes a novel solution called DOMR to address the above two tasks in a unified framework. The core of DOMR is an episode-based representation learning scheme that mimics the open-world setting through episodic training to learn a generalizable representation. The key insight is that the training process following the open-world setting forces the representation to accumulate experience in open recognition, thereby facilitating both the classification of known family instances and the detection of unknown family instances at inference. Given this representation, multiple one-vs-rest classifiers are subsequently built to make the final recognition decision through an aggregative strategy. Comparative experiments show that DOMR outperforms start-of-the-art methods, with macro-averaged F1-scores obtained on two datasets reaching 80.88% and 56.17% in the open case, and 79.34% and 49.55% in the incremental case, respectively. Ablation studies further analyze the effectiveness of DOMR in achieving the open recognition and incremental update goals.

求助该文献

最长约 10秒，即可获得该文献文件

DOMR: Toward Deep Open-World Malware Recognition

今日热心研友