CRIS Current Research Information System

Federated Learning (FL) has emerged as a key paradigm in machine learning but its performance often deteriorates under non-independent and identically distributed (non-IID) client data. Such heterogeneity frequently reflects geographic factors—for example, regional linguistic variations or localized traffic patterns—leading to IID data within regions but with non-IID distributions across them. However, existing FL algorithms are typically evaluated by randomly splitting non-IID data across devices, disregarding their spatial distribution. To address this gap, we introduce PROFED, a benchmark that simulates data splits with varying degrees of skewness across different regions. We incorporate several skewness methods from the literature and apply them to well-known datasets, including MNIST, FashionMNIST, Extended MNIST, CIFAR-10, CIFAR-100, and UTKFace. Our goal is to provide researchers with a standardized framework to evaluate FL algorithms more effectively and consistently against established baselines.

Domini, D., Ingemann, C.O., Aguzzi, G., Esterle, L., Viroli, M. (2026). ProFed: A Benchmark for Proximity-Based Non-IID Federated Learning. JOURNAL OF OPEN RESEARCH SOFTWARE, 14, 1-13 [10.5334/jors.624].

ProFed: A Benchmark for Proximity-Based Non-IID Federated Learning

Domini, Davide;Ingemann, Christian Otte;Aguzzi, Gianluca;Esterle, Lukas;Viroli, Mirko

2026

Abstract

Federated Learning (FL) has emerged as a key paradigm in machine learning but its performance often deteriorates under non-independent and identically distributed (non-IID) client data. Such heterogeneity frequently reflects geographic factors—for example, regional linguistic variations or localized traffic patterns—leading to IID data within regions but with non-IID distributions across them. However, existing FL algorithms are typically evaluated by randomly splitting non-IID data across devices, disregarding their spatial distribution. To address this gap, we introduce PROFED, a benchmark that simulates data splits with varying degrees of skewness across different regions. We incorporate several skewness methods from the literature and apply them to well-known datasets, including MNIST, FashionMNIST, Extended MNIST, CIFAR-10, CIFAR-100, and UTKFace. Our goal is to provide researchers with a standardized framework to evaluate FL algorithms more effectively and consistently against established baselines.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2026
			
	Rivista
	
				JOURNAL OF OPEN RESEARCH SOFTWARE
			
	Codice DOI
	
				https://dx.doi.org/10.5334/jors.624
			
	Citazione
	
				Domini, D., Ingemann, C.O., Aguzzi, G., Esterle, L., Viroli, M. (2026). ProFed: A Benchmark for Proximity-Based Non-IID Federated Learning. JOURNAL OF OPEN RESEARCH SOFTWARE, 14, 1-13 [10.5334/jors.624].
			
	Tutti gli autori
	
						Domini, Davide; Ingemann, Christian Otte; Aguzzi, Gianluca; Esterle, Lukas; Viroli, Mirko
					
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
69a5934e14106.pdf accesso aperto Tipo: Versione (PDF) editoriale / Version Of Record Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY) Dimensione 2.05 MB Formato Adobe PDF Visualizza/Apri	2.05 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1052410

Citazioni

ND

ND

ND

ND

social impact