CRIS Current Research Information System

Recognizing packaged grocery products based solely on appearance is still an open issue for modern computer vision systems due to peculiar challenges. Firstly, the number of different items to be recognized is huge (i.e., in the order of thousands) and rapidly changing over time. Moreover, there exist a significant domain shift between the images that should be recognized at test time, taken in stores by cheap cameras, and those available for training, usually just one or a few studio-quality images per product. We propose an end-to-end architecture comprising a GAN to address the domain shift at training time and a deep CNN trained on the samples generated by the GAN to learn an embedding of product images that enforces a hierarchy between product categories. At test time, we perform recognition by means of K-NN search against a database consisting of just one reference image per product. Experiments addressing recognition of products present in the training datasets as well as different ones unseen at training time show that our approach compares favorably to state-of-the-art methods on the grocery recognition task and generalize fairly well to similar ones.

Tonioni A., Di Stefano L. (2019). Domain invariant hierarchical embedding for grocery products recognition. COMPUTER VISION AND IMAGE UNDERSTANDING, 182, 81-92 [10.1016/j.cviu.2019.03.005].

Domain invariant hierarchical embedding for grocery products recognition

Tonioni A.;Di Stefano L.

2019

Abstract

Recognizing packaged grocery products based solely on appearance is still an open issue for modern computer vision systems due to peculiar challenges. Firstly, the number of different items to be recognized is huge (i.e., in the order of thousands) and rapidly changing over time. Moreover, there exist a significant domain shift between the images that should be recognized at test time, taken in stores by cheap cameras, and those available for training, usually just one or a few studio-quality images per product. We propose an end-to-end architecture comprising a GAN to address the domain shift at training time and a deep CNN trained on the samples generated by the GAN to learn an embedding of product images that enforces a hierarchy between product categories. At test time, we perform recognition by means of K-NN search against a database consisting of just one reference image per product. Experiments addressing recognition of products present in the training datasets as well as different ones unseen at training time show that our approach compares favorably to state-of-the-art methods on the grocery recognition task and generalize fairly well to similar ones.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2019
			
	Rivista
	
				COMPUTER VISION AND IMAGE UNDERSTANDING
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.cviu.2019.03.005
			
	Citazione
	
				Tonioni A.,  Di Stefano L. (2019). Domain invariant hierarchical embedding for grocery products recognition. COMPUTER VISION AND IMAGE UNDERSTANDING, 182, 81-92 [10.1016/j.cviu.2019.03.005].
			
	Tutti gli autori
	
						Tonioni A.; Di Stefano L.
					
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/737382

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

38

27

ND

social impact