Multimodal Industrial Anomaly Detection by Crossmodal Feature Mapping

46
citations
#610
in CVPR 2024
of 2716 papers
4
Top Authors
3
Data Points

Abstract

The paper explores the industrial multimodal Anomaly Detection (AD) task, which exploits point clouds and RGB images to localize anomalies. We introduce a novel light and fast framework that learns to map features from one modality to the other on nominal samples. At test time, anomalies are detected by pinpointing inconsistencies between observed and mapped features. Extensive experiments show that our approach achieves state-of-the-art detection and segmentation performance in both the standard and few-shot settings on the MVTec 3D-AD dataset while achieving faster inference and occupying less memory than previous multimodal AD methods. Moreover, we propose a layer-pruning technique to improve memory and time efficiency with a marginal sacrifice in performance.

Citation History

Jan 27, 2026
45
Feb 13, 2026
46+1
Feb 13, 2026
46