CRIS Current Research Information System

Human Activity Recognition (HAR) based on inertial data is an increasingly diffused task on embedded devices, from smartphones to ultra low-power sensors. Due to the high computational complexity of deep learning models, most embedded HAR systems are based on simple and not-so-accurate classic machine learning algorithms. This work bridges the gap between on-device HAR and deep learning, proposing a set of efficient one-dimensional Convolutional Neural Networks (CNNs) that can be deployed on general purpose microcontrollers (MCUs). Our CNNs are obtained combining hyper-parameters optimization with sub-byte and mixed-precision quantization, to find good trade-offs between classification results and memory occupation. Moreover, we also leverage adaptive inference as an orthogonal optimization to tune the inference complexity at runtime based on the processed input, hence producing a more flexible HAR system.With experiments on four datasets, and targeting an ultra-low-power RISC-V MCU, we show that (i) we are able to obtain a rich set of Pareto-optimal CNNs for HAR, spanning more than 1 order of magnitude in terms of memory, latency, and energy consumption; (ii) thanks to adaptive inference, we can derive >20 runtime operating modes starting from a single CNN, differing by up to 10% in classification scores and by more than 3x in inference complexity, with a limited memory overhead; (iii) on three of the four benchmarks, we outperform all previous deep learning methods, while reducing the memory occupation by more than 100x. The few methods that obtain better performance (both shallow and deep) are not compatible with MCU deployment; (iv) all our CNNs are compatible with real-time on-device HAR, achieving an inference latency that ranges between 9 mu s and 16 ms. Their memory occupation varies in 0.05-23.17 kB, and their energy consumption in 0.05 and 61.59 mu J, allowing years of continuous operation on a small battery supply.

Daghero, F., Burrello, A., Xie, C., Castellano, M., Gandolfi, L., Calimera, A., et al. (2022). Human Activity Recognition on Microcontrollers with Quantized and Adaptive Deep Neural Networks. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 21(4), 1-28 [10.1145/3542819].

Human Activity Recognition on Microcontrollers with Quantized and Adaptive Deep Neural Networks

Daghero, F;Burrello, A;Xie, C;Castellano, M;Gandolfi, L;Calimera, A;Macii, E;Poncino, M;Pagliari, DJ

2022

Abstract

Human Activity Recognition (HAR) based on inertial data is an increasingly diffused task on embedded devices, from smartphones to ultra low-power sensors. Due to the high computational complexity of deep learning models, most embedded HAR systems are based on simple and not-so-accurate classic machine learning algorithms. This work bridges the gap between on-device HAR and deep learning, proposing a set of efficient one-dimensional Convolutional Neural Networks (CNNs) that can be deployed on general purpose microcontrollers (MCUs). Our CNNs are obtained combining hyper-parameters optimization with sub-byte and mixed-precision quantization, to find good trade-offs between classification results and memory occupation. Moreover, we also leverage adaptive inference as an orthogonal optimization to tune the inference complexity at runtime based on the processed input, hence producing a more flexible HAR system.With experiments on four datasets, and targeting an ultra-low-power RISC-V MCU, we show that (i) we are able to obtain a rich set of Pareto-optimal CNNs for HAR, spanning more than 1 order of magnitude in terms of memory, latency, and energy consumption; (ii) thanks to adaptive inference, we can derive >20 runtime operating modes starting from a single CNN, differing by up to 10% in classification scores and by more than 3x in inference complexity, with a limited memory overhead; (iii) on three of the four benchmarks, we outperform all previous deep learning methods, while reducing the memory occupation by more than 100x. The few methods that obtain better performance (both shallow and deep) are not compatible with MCU deployment; (iv) all our CNNs are compatible with real-time on-device HAR, achieving an inference latency that ranges between 9 mu s and 16 ms. Their memory occupation varies in 0.05-23.17 kB, and their energy consumption in 0.05 and 61.59 mu J, allowing years of continuous operation on a small battery supply.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Rivista
	
				ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS
			
	Codice DOI
	
				https://dx.doi.org/10.1145/3542819
			
	Citazione
	
				Daghero, F., Burrello, A., Xie, C., Castellano, M., Gandolfi, L., Calimera, A., et al. (2022). Human Activity Recognition on Microcontrollers with Quantized and Adaptive Deep Neural Networks. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 21(4), 1-28 [10.1145/3542819].
			
	Tutti gli autori
	
						Daghero, F; Burrello, A; Xie, C; Castellano, M; Gandolfi, L; Calimera, A; Macii, E; Poncino, M; Pagliari, DJ
					
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
3542819.pdf accesso aperto Tipo: Versione (PDF) editoriale Licenza: Creative commons Dimensione 4.34 MB Formato Adobe PDF Visualizza/Apri	4.34 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/900489

Citazioni

ND

22

17

social impact