计算机科学
异常检测
人工智能
特征(语言学)
模式识别(心理学)
代表(政治)
适应(眼睛)
构造(python库)
推论
特征学习
一致性(知识库)
情态动词
任务(项目管理)
机器学习
工程类
哲学
语言学
物理
化学
光学
系统工程
政治
政治学
高分子化学
法学
程序设计语言
作者
Yuanpeng Tu,Boshen Zhang,Liang Liu,Yuxi Li,Chenhai Xu,Jiangning Zhang,Yabiao Wang,Chengjie Wang,Cai Rong Zhao
出处
期刊:Cornell University - arXiv
日期:2024-01-01
标识
DOI:10.48550/arxiv.2401.03145
摘要
Industrial anomaly detection is generally addressed as an unsupervised task that aims at locating defects with only normal training samples. Recently, numerous 2D anomaly detection methods have been proposed and have achieved promising results, however, using only the 2D RGB data as input is not sufficient to identify imperceptible geometric surface anomalies. Hence, in this work, we focus on multi-modal anomaly detection. Specifically, we investigate early multi-modal approaches that attempted to utilize models pre-trained on large-scale visual datasets, i.e., ImageNet, to construct feature databases. And we empirically find that directly using these pre-trained models is not optimal, it can either fail to detect subtle defects or mistake abnormal features as normal ones. This may be attributed to the domain gap between target industrial data and source data.Towards this problem, we propose a Local-to-global Self-supervised Feature Adaptation (LSFA) method to finetune the adaptors and learn task-oriented representation toward anomaly detection.Both intra-modal adaptation and cross-modal alignment are optimized from a local-to-global perspective in LSFA to ensure the representation quality and consistency in the inference stage.Extensive experiments demonstrate that our method not only brings a significant performance boost to feature embedding based approaches, but also outperforms previous State-of-The-Art (SoTA) methods prominently on both MVTec-3D AD and Eyecandies datasets, e.g., LSFA achieves 97.1% I-AUROC on MVTec-3D, surpass previous SoTA by +3.4%.
科研通智能强力驱动
Strongly Powered by AbleSci AI