A 1036 TOp/s/W, 12.2 mW, 2.72 mu J/Inference All Digital TNN Accelerator in 22 nm FDX Technology for TinyML Applications

Scherer, M; Di Mauro, A; Rutishauser, G; Fischer, T; Benini, L

doi:10.1109/COOLCHIPS54332.2022.9772668

Tiny Machine Learning (TinyML) applications impose mu J/Inference constraints, with maximum power consumption of a few tens of mW. It is extremely challenging to meet these requirement at a reasonable accuracy level. In this work, we address this challenge with a flexible, fully digital Ternary Neural Network (TNN) accelerator in a RISC-V-based SoC. The design achieves 2.72 mu J/Inference, 12.2 mW, 3200 Inferences/sec at 0.5 V for a non-trivial 9-layer, 96 channels-per-layer network with CIFAR-10 accuracy of 86 %. The peak energy efficiency is 1036 TOp/s/W, outperforming the state-of-the-art in silicon-proven TinyML accelerators by 1.67x.

Scherer, M., Di Mauro, A., Rutishauser, G., Fischer, T., Benini, L. (2022). A 1036 TOp/s/W, 12.2 mW, 2.72 mu J/Inference All Digital TNN Accelerator in 22 nm FDX Technology for TinyML Applications. 345 E 47TH ST, NEW YORK, NY 10017 USA : IEEE [10.1109/COOLCHIPS54332.2022.9772668].

A 1036 TOp/s/W, 12.2 mW, 2.72 mu J/Inference All Digital TNN Accelerator in 22 nm FDX Technology for TinyML Applications

Scherer, M;Di Mauro, A;Rutishauser, G;Fischer, T;Benini, L

2022

Abstract

Tiny Machine Learning (TinyML) applications impose mu J/Inference constraints, with maximum power consumption of a few tens of mW. It is extremely challenging to meet these requirement at a reasonable accuracy level. In this work, we address this challenge with a flexible, fully digital Ternary Neural Network (TNN) accelerator in a RISC-V-based SoC. The design achieves 2.72 mu J/Inference, 12.2 mW, 3200 Inferences/sec at 0.5 V for a non-trivial 9-layer, 96 channels-per-layer network with CIFAR-10 accuracy of 86 %. The peak energy efficiency is 1036 TOp/s/W, outperforming the state-of-the-art in silicon-proven TinyML accelerators by 1.67x.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Titolo del volume
	
				2022 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS)
			
	Pagina iniziale
	
				1
			
	Pagina finale
	
				3
			
	Codice DOI
	
				https://dx.doi.org/10.1109/COOLCHIPS54332.2022.9772668
			
	Citazione
	
				Scherer, M., Di Mauro, A., Rutishauser, G., Fischer, T., Benini, L. (2022). A 1036 TOp/s/W, 12.2 mW, 2.72 mu J/Inference All Digital TNN Accelerator in 22 nm FDX Technology for TinyML Applications. 345 E 47TH ST, NEW YORK, NY 10017 USA : IEEE [10.1109/COOLCHIPS54332.2022.9772668].
			
	Tutti gli autori
	
						Scherer, M; Di Mauro, A; Rutishauser, G; Fischer, T; Benini, L

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/905340

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

6

3

CRIS Current Research Information System