On the Thermodynamic Interpretation of Deep Learning Systems

Fioresi, R.; Faglioni, F.; Morri, F.; Squadrani, L.

doi:10.1007/978-3-030-80209-7_97

In the study of time evolution of the parameters in Deep Learning systems, subject to optimization via SGD (stochastic gradient descent), temperature, entropy and other thermodynamic notions are commonly employed to exploit the Boltzmann formalism. We show that, in simulations on popular databases (CIFAR10, MNIST), such simplified models appear inadequate: different regions in the parameter space exhibit significantly different temperatures and no elementary function expresses the temperature in terms of learning rate and batch size, as commonly assumed. This suggests a more conceptual approach involving contact dynamics and Lie Group Thermodynamics.

Fioresi R., Faglioni F., Morri F., Squadrani L. (2021). On the Thermodynamic Interpretation of Deep Learning Systems [10.1007/978-3-030-80209-7_97].

On the Thermodynamic Interpretation of Deep Learning Systems

Fioresi R.;Faglioni F.;Morri F.;Squadrani L.

2021

Abstract

In the study of time evolution of the parameters in Deep Learning systems, subject to optimization via SGD (stochastic gradient descent), temperature, entropy and other thermodynamic notions are commonly employed to exploit the Boltzmann formalism. We show that, in simulations on popular databases (CIFAR10, MNIST), such simplified models appear inadequate: different regions in the parameter space exhibit significantly different temperatures and no elementary function expresses the temperature in terms of learning rate and batch size, as commonly assumed. This suggests a more conceptual approach involving contact dynamics and Lie Group Thermodynamics.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2021
			
	Titolo del volume
	
				LECTURE NOTES IN ARTIFICIAL INTELLIGENCE
			
	Pagina iniziale
	
				909
			
	Pagina finale
	
				917
			
	Codice DOI
	
				https://dx.doi.org/10.1007/978-3-030-80209-7_97
			
	Citazione
	
				Fioresi R.,  Faglioni F.,  Morri F.,  Squadrani L. (2021). On the Thermodynamic Interpretation of Deep Learning Systems [10.1007/978-3-030-80209-7_97].
			
	Tutti gli autori
	
						Fioresi R.; Faglioni F.; Morri F.; Squadrani L.
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/861355

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

2

1

ND

CRIS Current Research Information System