CRIS Current Research Information System

Transfer learning can significantly improve the sample efficiency of neural networks, by exploiting the relatedness between a data-scarce target task and a data-abundant source task. Despite years of successful applications, transfer learning practice often relies on ad-hoc solutions, while theoretical understanding of these procedures is still limited. In the present work, we re-think a solvable model of synthetic data as a framework for modeling correlation between data-sets. This setup allows for an analytic characterization of the generalization performance obtained when transferring the learned feature map from the source to the target task. Focusing on the problem of training two-layer networks in a binary classification setting, we show that our model can capture a range of salient features of transfer learning with real data. Moreover, by exploiting parametric control over the correlation between the two data-sets, we systematically investigate under which conditions the transfer of features is beneficial for generalization.

Gerace, F., Saglietti, L., Sarao Mannelli, S., Saxe, A., Zdeborová, L. (2022). Probing transfer learning with a model of synthetic correlated datasets. MACHINE LEARNING: SCIENCE AND TECHNOLOGY, 3(1), 1-21 [10.1088/2632-2153/ac4f3f].

Probing transfer learning with a model of synthetic correlated datasets

Gerace, Federica;Saglietti, Luca;Sarao Mannelli, Stefano;Saxe, Andrew;Zdeborová, Lenka

2022

Abstract

Transfer learning can significantly improve the sample efficiency of neural networks, by exploiting the relatedness between a data-scarce target task and a data-abundant source task. Despite years of successful applications, transfer learning practice often relies on ad-hoc solutions, while theoretical understanding of these procedures is still limited. In the present work, we re-think a solvable model of synthetic data as a framework for modeling correlation between data-sets. This setup allows for an analytic characterization of the generalization performance obtained when transferring the learned feature map from the source to the target task. Focusing on the problem of training two-layer networks in a binary classification setting, we show that our model can capture a range of salient features of transfer learning with real data. Moreover, by exploiting parametric control over the correlation between the two data-sets, we systematically investigate under which conditions the transfer of features is beneficial for generalization.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Rivista
	
				MACHINE LEARNING: SCIENCE AND TECHNOLOGY
			
	Codice DOI
	
				https://dx.doi.org/10.1088/2632-2153/ac4f3f
			
	Citazione
	
				Gerace, F., Saglietti, L., Sarao Mannelli, S., Saxe, A., Zdeborová, L. (2022). Probing transfer learning with a model of synthetic correlated datasets. MACHINE LEARNING: SCIENCE AND TECHNOLOGY, 3(1), 1-21 [10.1088/2632-2153/ac4f3f].
			
	Tutti gli autori
	
						Gerace, Federica; Saglietti, Luca; Sarao Mannelli, Stefano; Saxe, Andrew; Zdeborová, Lenka
					
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Gerace_2022_Mach._Learn. _Sci._Technol._3_015030.pdf accesso aperto Tipo: Versione (PDF) editoriale Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY) Dimensione 1.38 MB Formato Adobe PDF Visualizza/Apri	1.38 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/969588

Citazioni

ND

11

10

social impact