计算机科学
对抗制
依赖关系(UML)
可靠性(半导体)
数据挖掘
数据中心
领域(数学分析)
特征(语言学)
钥匙(锁)
域适应
断层(地质)
数据建模
分布式计算
机器学习
人工智能
计算机网络
物理
语言学
地质学
数学分析
哲学
计算机安全
分类器(UML)
功率(物理)
地震学
数据库
数学
量子力学
作者
Xiang Lan,Dianwen Ng,Yi Liu,Jiongzhou Liu,Fan Xu,Cheng He,Mengling Feng
标识
DOI:10.1109/ijcnn52387.2021.9533383
摘要
Disk fault is known to be the key cause of data loss in the modern large-scale data center, which affects the reliability and stability of the server and even the whole IT infrastructure, resulting in high financial cost. Recent works on disk fault prediction demonstrate the ability of machine learning techniques in the early prediction of disk failure. However, two limitations hinder the real-world application of current methods. First, they ignore the data heterogeneity in the data center, where distribution shifts commonly exist across different disk model types that decrease the model's performance. Second, the number of disks of different disk model types varies greatly in the data center, and current methods failed to deliver an acceptable performance over disk model types with few training samples. To address these limitations, we propose adversarial domain adaptation with correlation-based association networks (ADA-CBAN) to both mitigate the distribution shift problem and also to boost the performance on small-scaled data. Extensive experiments prove that our proposed model is effective and achieves new state-of-the-art. In addition, our post model analysis can also reveal important feature interactions and highlight the crucial period before disk faults, where both are useful information in real-world applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI