Quality inspection tasks, i.e., anomaly detection, localization and classification, face the scarcity of non-nominal images in real industrial scenarios. Hence, generative models have been explored as a tool to obtain defective images from few real labelled samples. Despite the fast-increasing quality of such models, generating realistic defective images remains a challenging task due to the same data scarcity problem, which makes it difficult to steer large general-purpose models to produce realistic defects for specific industrial products. In this paper, we show how casting defect generation as inpainting of nominal images and using ControlNet to specialize a state-of-the-art inpainting model based on stable diffusion can be an effective solution for the few-shot anomaly generation task. Extensive experimental results on the MVTec-AD dataset demonstrate that the high quality of the images generated by our method significantly improves the state of the art on downstream anomaly classification.
Ali, M., Fioraio, N., Salti, S., Di Stefano, L. (2024). AnomalyControl: Few-Shot Anomaly Generation by ControlNet Inpainting. IEEE ACCESS, 12, 192903-192914 [10.1109/access.2024.3520002].
AnomalyControl: Few-Shot Anomaly Generation by ControlNet Inpainting
Ali, Musawar
Primo
;Salti, SamuelePenultimo
;Di Stefano, LuigiUltimo
2024
Abstract
Quality inspection tasks, i.e., anomaly detection, localization and classification, face the scarcity of non-nominal images in real industrial scenarios. Hence, generative models have been explored as a tool to obtain defective images from few real labelled samples. Despite the fast-increasing quality of such models, generating realistic defective images remains a challenging task due to the same data scarcity problem, which makes it difficult to steer large general-purpose models to produce realistic defects for specific industrial products. In this paper, we show how casting defect generation as inpainting of nominal images and using ControlNet to specialize a state-of-the-art inpainting model based on stable diffusion can be an effective solution for the few-shot anomaly generation task. Extensive experimental results on the MVTec-AD dataset demonstrate that the high quality of the images generated by our method significantly improves the state of the art on downstream anomaly classification.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.