CRIS Current Research Information System

Implementing embedded neural network processing at the edge requires efficient hardware acceleration that combines high computational throughput with low power consumption. Driven by the rapid evolution of network architectures and their algorithmic features, accelerator designs are constantly being adapted to support the improved functionalities. Hardware designers can refer to a myriad of accelerator implementations in the literature to evaluate and compare hardware design choices. However, the sheer number of publications and their diverse optimization directions hinder an effective assessment. Existing surveys provide an overview of these works but are often limited to system-level and benchmark-specific performance metrics, making it difficult to quantitatively compare the individual effects of each utilized optimization technique. This complicates the evaluation of optimizations for new accelerator designs, slowing-down the research progress. In contrast to previous surveys, this work provides a quantitative overview of neural network accelerator optimization approaches that have been used in recent works and reports their individual effects on edge processing performance. The list of optimizations and their quantitative effects are presented as a construction kit, allowing to assess the design choices for each building block individually. Reported optimizations range from up to 10,000× memory savings to 33× energy reductions, providing chip designers with an overview of design choices for implementing efficient low power neural network accelerators.

P. Jokic, E. Azarkhish, A. Bonetti, M. Pons, S. Emery, L. Benini (2022). A Construction Kit for Efficient Low Power Neural Network Accelerator Designs. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 21(5), 1-36 [10.1145/3520127].

A Construction Kit for Efficient Low Power Neural Network Accelerator Designs

P. Jokic;E. Azarkhish;A. Bonetti;M. Pons;S. Emery;L. Benini

2022

Abstract

Implementing embedded neural network processing at the edge requires efficient hardware acceleration that combines high computational throughput with low power consumption. Driven by the rapid evolution of network architectures and their algorithmic features, accelerator designs are constantly being adapted to support the improved functionalities. Hardware designers can refer to a myriad of accelerator implementations in the literature to evaluate and compare hardware design choices. However, the sheer number of publications and their diverse optimization directions hinder an effective assessment. Existing surveys provide an overview of these works but are often limited to system-level and benchmark-specific performance metrics, making it difficult to quantitatively compare the individual effects of each utilized optimization technique. This complicates the evaluation of optimizations for new accelerator designs, slowing-down the research progress. In contrast to previous surveys, this work provides a quantitative overview of neural network accelerator optimization approaches that have been used in recent works and reports their individual effects on edge processing performance. The list of optimizations and their quantitative effects are presented as a construction kit, allowing to assess the design choices for each building block individually. Reported optimizations range from up to 10,000× memory savings to 33× energy reductions, providing chip designers with an overview of design choices for implementing efficient low power neural network accelerators.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Rivista
	
				ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS
			
	Codice DOI
	
				https://dx.doi.org/10.1145/3520127
			
	Citazione
	
				P. Jokic,  E. Azarkhish,  A. Bonetti,  M. Pons,  S. Emery,  L. Benini (2022). A Construction Kit for Efficient Low Power Neural Network Accelerator Designs. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 21(5), 1-36 [10.1145/3520127].
			
	Tutti gli autori
	
						P. Jokic; E. Azarkhish; A. Bonetti; M. Pons; S. Emery; L. Benini
					
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
A Construction Kit for Efficient Low Power Neural Network Accelerator Designs.pdf accesso aperto Descrizione: versione editoriale Tipo: Versione (PDF) editoriale / Version Of Record Licenza: Creative commons Dimensione 2.08 MB Formato Adobe PDF Visualizza/Apri	2.08 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/907565

Citazioni

ND

4

2

social impact