Neural network distillation on IoT platforms for sound event detection

Cerutti, G.; Prasad, R.; Brutti, A.; Farella, E.

doi:10.21437/Interspeech.2019-2394

In most classification tasks, wide and deep neural networks perform and generalize better than their smaller counterparts, in particular when they are exposed to large and heterogeneous training sets. However, in the emerging field of Internet of Things memory footprint and energy budget pose severe limits on the size and complexity of the neural models that can be implemented on embedded devices. The Student-Teacher approach is an attractive strategy to distill knowledge from a large network into smaller ones, that can fit on low-energy low-complexity embedded IoT platforms. In this paper, we consider the outdoor sound event detection task as a use case. Building upon the VGGish network, we investigate different distillation strategies to substantially reduce the classifier's size and computational cost with minimal performance losses. Experiments on the UrbanSound8K dataset show that extreme compression factors (up to 4.2 · 10−4 for parameters and 1.2 · 10−3 for operations with respect to VGGish) can be achieved, limiting the accuracy degradation from 75% to 70%. Finally, we compare different embedded platforms to analyze the trade-off between available resources and achievable accuracy.

Cerutti G., Prasad R., Brutti A., Farella E. (2019). Neural network distillation on IoT platforms for sound event detection. ;4 Rue des Fauvettes - Lous Tourils : International Speech Communication Association [10.21437/Interspeech.2019-2394].

Neural network distillation on IoT platforms for sound event detection

Cerutti G.;Prasad R.;Brutti A.;Farella E.

2019

Abstract

In most classification tasks, wide and deep neural networks perform and generalize better than their smaller counterparts, in particular when they are exposed to large and heterogeneous training sets. However, in the emerging field of Internet of Things memory footprint and energy budget pose severe limits on the size and complexity of the neural models that can be implemented on embedded devices. The Student-Teacher approach is an attractive strategy to distill knowledge from a large network into smaller ones, that can fit on low-energy low-complexity embedded IoT platforms. In this paper, we consider the outdoor sound event detection task as a use case. Building upon the VGGish network, we investigate different distillation strategies to substantially reduce the classifier's size and computational cost with minimal performance losses. Experiments on the UrbanSound8K dataset show that extreme compression factors (up to 4.2 · 10−4 for parameters and 1.2 · 10−3 for operations with respect to VGGish) can be achieved, limiting the accuracy degradation from 75% to 70%. Finally, we compare different embedded platforms to analyze the trade-off between available resources and achievable accuracy.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2019
			
	Titolo del volume
	
				Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
			
	Pagina iniziale
	
				3609
			
	Pagina finale
	
				3613
			
	Collana/Serie
	
				INTERSPEECH
			
	Codice DOI
	
				https://dx.doi.org/10.21437/Interspeech.2019-2394
			
	Citazione
	
				Cerutti G.,  Prasad R.,  Brutti A.,  Farella E. (2019). Neural network distillation on IoT platforms for sound event detection. ;4 Rue des Fauvettes - Lous Tourils : International Speech Communication Association [10.21437/Interspeech.2019-2394].
			
	Tutti gli autori
	
						Cerutti G.; Prasad R.; Brutti A.; Farella E.
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/800136

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

34

ND

CRIS Current Research Information System