CRIS Current Research Information System

Features in images’ backgrounds can spuriously correlate with the images’ classes, representing background bias. They can influence the classifier’s decisions, causing shortcut learning (Clever Hans effect). The phenomenon generates deep neural networks (DNNs) that perform well on standard evaluation datasets but generalize poorly to real-world data. Layer-wise Relevance Propagation (LRP) explains DNNs’ decisions. Here, we show that the optimization of LRP heatmaps can minimize the background bias influence on deep classifiers, hindering shortcut learning. By not increasing run-time computational cost, the approach is light and fast. Furthermore, it applies to virtually any classification architecture. After injecting synthetic bias in images’ backgrounds, we compared our approach (dubbed ISNet) to eight state-of-the-art DNNs, quantitatively demonstrating its superior robustness to background bias. Mixed datasets are common for COVID-19 and tuberculosis classification with chest X-rays, fostering background bias. By focusing on the lungs, the ISNet reduced shortcut learning. Thus, its generalization performance on external (out-of-distribution) test databases significantly surpassed all implemented benchmark models.

Bassi P.R.A.S., Dertkigil S.S.J., Cavalli A. (2024). Improving deep neural network generalization and robustness to background bias via layer-wise relevance propagation optimization. NATURE COMMUNICATIONS, 15(1), 291-306 [10.1038/s41467-023-44371-z].

Improving deep neural network generalization and robustness to background bias via layer-wise relevance propagation optimization

Bassi P. R. A. S.;Dertkigil S. S. J.;Cavalli A.

2024

Abstract

Features in images’ backgrounds can spuriously correlate with the images’ classes, representing background bias. They can influence the classifier’s decisions, causing shortcut learning (Clever Hans effect). The phenomenon generates deep neural networks (DNNs) that perform well on standard evaluation datasets but generalize poorly to real-world data. Layer-wise Relevance Propagation (LRP) explains DNNs’ decisions. Here, we show that the optimization of LRP heatmaps can minimize the background bias influence on deep classifiers, hindering shortcut learning. By not increasing run-time computational cost, the approach is light and fast. Furthermore, it applies to virtually any classification architecture. After injecting synthetic bias in images’ backgrounds, we compared our approach (dubbed ISNet) to eight state-of-the-art DNNs, quantitatively demonstrating its superior robustness to background bias. Mixed datasets are common for COVID-19 and tuberculosis classification with chest X-rays, fostering background bias. By focusing on the lungs, the ISNet reduced shortcut learning. Thus, its generalization performance on external (out-of-distribution) test databases significantly surpassed all implemented benchmark models.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Rivista
	
				NATURE COMMUNICATIONS
			
	Codice DOI
	
				https://dx.doi.org/10.1038/s41467-023-44371-z
			
	Citazione
	
				Bassi P.R.A.S.,  Dertkigil S.S.J.,  Cavalli A. (2024). Improving deep neural network generalization and robustness to background bias via layer-wise relevance propagation optimization. NATURE COMMUNICATIONS, 15(1), 291-306 [10.1038/s41467-023-44371-z].
			
	Tutti gli autori
	
						Bassi P.R.A.S.; Dertkigil S.S.J.; Cavalli A.
					
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Nat_Commun.pdf accesso aperto Tipo: Versione (PDF) editoriale Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY) Dimensione 4.11 MB Formato Adobe PDF Visualizza/Apri	4.11 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/966525

Citazioni

2

11

10

social impact