Stella Nera: A Differentiable Maddness-Based Hardware Accelerator for Efficient Approximate Matrix Multiplication

Schönleber, Jannis; Cavigelli, Lukas; Perotti, Matteo; Benini, Luca; Andri, Renzo

doi:10.1109/isvlsi65124.2025.11130225

Artificial intelligence has surged in recent years, with advancements in machine learning rapidly impacting nearly every area of life. However, the growing complexity of these models has far outpaced advancements in available hardware accelerators, leading to significant computational and energy demands, primarily due to matrix multiplications, which dominate the compute workload. Maddness (i.e., Multiply-ADDitioN-lESS) presents a hash-based version of product quantization, which renders matrix multiplications into lookups and additions, eliminating the need for multipliers entirely. We present Stella Nera1, the first MADDNESS-based accelerator achieving an energy efficiency of 161 TOp/s/[email protected], 25x better than conventional MatMul accelerators due to its small components and reduced computational complexity. We further enhance Maddness with a differentiable approximation, allowing for gradient-based fine-tuning and achieving an end-to-end performance of 92.5% Top-1 accuracy on CIFAR-10.1Named after an album of the famous Swiss musicians Patent Ochsner

Schönleber, J., Cavigelli, L., Perotti, M., Benini, L., Andri, R. (2025). Stella Nera: A Differentiable Maddness-Based Hardware Accelerator for Efficient Approximate Matrix Multiplication. 345 E 47TH ST, NEW YORK, NY 10017 USA : IEEE Computer Society [10.1109/isvlsi65124.2025.11130225].

Stella Nera: A Differentiable Maddness-Based Hardware Accelerator for Efficient Approximate Matrix Multiplication

Schönleber, Jannis;Cavigelli, Lukas;Perotti, Matteo;Benini, Luca;Andri, Renzo

2025

Abstract

Artificial intelligence has surged in recent years, with advancements in machine learning rapidly impacting nearly every area of life. However, the growing complexity of these models has far outpaced advancements in available hardware accelerators, leading to significant computational and energy demands, primarily due to matrix multiplications, which dominate the compute workload. Maddness (i.e., Multiply-ADDitioN-lESS) presents a hash-based version of product quantization, which renders matrix multiplications into lookups and additions, eliminating the need for multipliers entirely. We present Stella Nera1, the first MADDNESS-based accelerator achieving an energy efficiency of 161 TOp/s/[email protected], 25x better than conventional MatMul accelerators due to its small components and reduced computational complexity. We further enhance Maddness with a differentiable approximation, allowing for gradient-based fine-tuning and achieving an end-to-end performance of 92.5% Top-1 accuracy on CIFAR-10.1Named after an album of the famous Swiss musicians Patent Ochsner

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Titolo del volume
	
				Proceedings of IEEE Computer Society Annual Symposium on VLSI, ISVLSI
			
	Pagina iniziale
	
				1
			
	Pagina finale
	
				6
			
	Codice DOI
	
				https://dx.doi.org/10.1109/isvlsi65124.2025.11130225
			
	Citazione
	
				Schönleber, J., Cavigelli, L., Perotti, M., Benini, L., Andri, R. (2025). Stella Nera: A Differentiable Maddness-Based Hardware Accelerator for Efficient Approximate Matrix Multiplication. 345 E 47TH ST, NEW YORK, NY 10017 USA : IEEE Computer Society [10.1109/isvlsi65124.2025.11130225].
			
	Tutti gli autori
	
						Schönleber, Jannis; Cavigelli, Lukas; Perotti, Matteo; Benini, Luca; Andri, Renzo

File in questo prodotto:

Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1040836

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

0

0

ND

CRIS Current Research Information System