Latent replay for real-time continual learning

Pellegrini, L.; Graffieti, G.; Lomonaco, V.; Maltoni, D.

doi:10.1109/IROS45743.2020.9341460

Training deep neural networks at the edge on light computational devices, embedded systems and robotic platforms is nowadays very challenging. Continual learning techniques, where complex models are incrementally trained on small batches of new data, can make the learning problem tractable even for CPU-only embedded devices enabling remarkable levels of adaptiveness and autonomy. However, a number of practical problems need to be solved: catastrophic forgetting before anything else. In this paper we introduce an original technique named "Latent Replay"where, instead of storing a portion of past data in the input space, we store activations volumes at some intermediate layer. This can significantly reduce the computation and storage required by native rehearsal. To keep the representation stable and the stored activations valid we propose to slow-down learning at all the layers below the latent replay one, leaving the layers above free to learn at full pace. In our experiments we show that Latent Replay, combined with existing continual learning techniques, achieves state-of-the-art performance on complex video benchmarks such as CORe50 NICv2 (with nearly 400 small and highly non-i.i.d. batches) and OpenLORIS. Finally, we demonstrate the feasibility of nearly real-time continual learning on the edge through the deployment of the proposed technique on a smartphone device.

Pellegrini L., Graffieti G., Lomonaco V., Maltoni D. (2020). Latent replay for real-time continual learning. Institute of Electrical and Electronics Engineers Inc. [10.1109/IROS45743.2020.9341460].

Latent replay for real-time continual learning

Pellegrini L.^Primo;Graffieti G.^Secondo;Lomonaco V.^Penultimo;Maltoni D.^Ultimo

2020

Abstract

Training deep neural networks at the edge on light computational devices, embedded systems and robotic platforms is nowadays very challenging. Continual learning techniques, where complex models are incrementally trained on small batches of new data, can make the learning problem tractable even for CPU-only embedded devices enabling remarkable levels of adaptiveness and autonomy. However, a number of practical problems need to be solved: catastrophic forgetting before anything else. In this paper we introduce an original technique named "Latent Replay"where, instead of storing a portion of past data in the input space, we store activations volumes at some intermediate layer. This can significantly reduce the computation and storage required by native rehearsal. To keep the representation stable and the stored activations valid we propose to slow-down learning at all the layers below the latent replay one, leaving the layers above free to learn at full pace. In our experiments we show that Latent Replay, combined with existing continual learning techniques, achieves state-of-the-art performance on complex video benchmarks such as CORe50 NICv2 (with nearly 400 small and highly non-i.i.d. batches) and OpenLORIS. Finally, we demonstrate the feasibility of nearly real-time continual learning on the edge through the deployment of the proposed technique on a smartphone device.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2020
			
	Titolo del volume
	
				IEEE International Conference on Intelligent Robots and Systems
			
	Pagina iniziale
	
				10203
			
	Pagina finale
	
				10209
			
	Collana/Serie
	
				PROCEEDINGS OF THE ... IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS
			
	Codice DOI
	
				https://dx.doi.org/10.1109/IROS45743.2020.9341460
			
	Citazione
	
				Pellegrini L.,  Graffieti G.,  Lomonaco V.,  Maltoni D. (2020). Latent replay for real-time continual learning. Institute of Electrical and Electronics Engineers Inc. [10.1109/IROS45743.2020.9341460].
			
	Tutti gli autori
	
						Pellegrini L.; Graffieti G.; Lomonaco V.; Maltoni D.
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/834400

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

131

102

CRIS Current Research Information System