CRIS Current Research Information System

Deep neural networks achieve outstanding results for challenging image classification tasks. However, the design of network topologies is a complex task, and the research community is conducting ongoing efforts to discover top-accuracy topologies, either manually or by employing expensive architecture searches. We propose a unique narrow-space architecture search that focuses on delivering low-cost and rapidly executing networks that respect strict memory and time requirements typical of Internet-of-Things (IoT) near-sensor computing platforms. Our approach provides solutions with classification latencies below 10 ms running on a low-cost device with 1 GB RAM and a peak performance of 5.6 GFLOPS. The narrow-space search of floating-point models improves the accuracy on CIFAR10 of an established IoT model from 70.64% to 74.87% within the same memory constraints. We further improve the accuracy to 82.07% by including 16-bit half types and obtain the highest accuracy of 83.45% by extending the search with model-optimized IEEE 754 reduced types. To the best of our knowledge, this is the first empirical demonstration of more than 3000 trained models that run with reduced precision and push the Pareto optimal front by a wide margin. Within a given memory constraint, accuracy is improved by more than 7% points for half and more than 1% points for the best individual model format.

Scheidegger, F., Benini, L., Bekas, C., Malossi, C. (2019). Constrained deep neural network architecture search for IoT devices accounting for hardware calibration. 10010 NORTH TORREY PINES RD, LA JOLLA, CALIFORNIA 92037 USA : NEURAL INFORMATION PROCESSING SYSTEMS (NIPS).

Constrained deep neural network architecture search for IoT devices accounting for hardware calibration

Scheidegger, F;Benini, L;Bekas, C;Malossi, C

2019

Abstract

Deep neural networks achieve outstanding results for challenging image classification tasks. However, the design of network topologies is a complex task, and the research community is conducting ongoing efforts to discover top-accuracy topologies, either manually or by employing expensive architecture searches. We propose a unique narrow-space architecture search that focuses on delivering low-cost and rapidly executing networks that respect strict memory and time requirements typical of Internet-of-Things (IoT) near-sensor computing platforms. Our approach provides solutions with classification latencies below 10 ms running on a low-cost device with 1 GB RAM and a peak performance of 5.6 GFLOPS. The narrow-space search of floating-point models improves the accuracy on CIFAR10 of an established IoT model from 70.64% to 74.87% within the same memory constraints. We further improve the accuracy to 82.07% by including 16-bit half types and obtain the highest accuracy of 83.45% by extending the search with model-optimized IEEE 754 reduced types. To the best of our knowledge, this is the first empirical demonstration of more than 3000 trained models that run with reduced precision and push the Pareto optimal front by a wide margin. Within a given memory constraint, accuracy is improved by more than 7% points for half and more than 1% points for the best individual model format.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2019
			
	Titolo del volume
	
				Advances in Neural Information Processing Systems 32 (NIPS 2019)
			
	Pagina iniziale
	
				1
			
	Pagina finale
	
				11
			
	Collana/Serie
	
				ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS
			
	Citazione
	
				Scheidegger, F., Benini, L., Bekas, C., Malossi, C. (2019). Constrained deep neural network architecture search for IoT devices accounting for hardware calibration. 10010 NORTH TORREY PINES RD, LA JOLLA, CALIFORNIA 92037 USA : NEURAL INFORMATION PROCESSING SYSTEMS (NIPS).
			
	Tutti gli autori
	
						Scheidegger, F; Benini, L; Bekas, C; Malossi, C
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
Postprint_Constrained deep neural network architecture.pdf accesso aperto Descrizione: Articolo postprint Tipo: Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review Licenza: Licenza per accesso libero gratuito Dimensione 1.1 MB Formato Adobe PDF Visualizza/Apri	1.1 MB	Adobe PDF	Visualizza/Apri
Constrained deep neural network architecture search for IoT.pdf accesso aperto Descrizione: Versione Editoriale Tipo: Versione (PDF) editoriale / Version Of Record Licenza: Licenza per accesso libero gratuito Dimensione 783.04 kB Formato Adobe PDF Visualizza/Apri	783.04 kB	Adobe PDF	Visualizza/Apri
main_appendix_v004.pdf accesso aperto Tipo: File Supplementare Licenza: Licenza per accesso libero gratuito Dimensione 327.79 kB Formato Adobe PDF Visualizza/Apri	327.79 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/767350

Citazioni

ND

9

0

social impact