The current trend of over-parameterized Deep Neural Networks makes the deployment on resource constrained systems challenging. To deal with this, optimization techniques, such as network pruning, can be adopted. We propose a novel pruning technique based on trainable probability masks that, when binarized, select the elements of the network to prune. Our method features i) an automatic selections of the elements to prune by jointly training the binary masks with the model, ii) the capability of controlling the pruning level through hyper-parameters of a novel regularization term. We assess the effectiveness of our method by employing it in the structured pruning of the fully connected layers of shallow and deep neural networks where it outperforms the magnitude-based pruning approaches

Martinini, F., Enttsel, A., Marchioni, A., Mangia, M., Rovatti, R., Setti, G. (2023). Structured Pruning in Deep Neural Networks with Trainable Probability Masks. New York : IEEE [10.1109/mwscas57524.2023.10405945].

Structured Pruning in Deep Neural Networks with Trainable Probability Masks

Martinini, F.;Enttsel, A.;Marchioni, A.;Mangia, M.;Rovatti, R.;
2023

Abstract

The current trend of over-parameterized Deep Neural Networks makes the deployment on resource constrained systems challenging. To deal with this, optimization techniques, such as network pruning, can be adopted. We propose a novel pruning technique based on trainable probability masks that, when binarized, select the elements of the network to prune. Our method features i) an automatic selections of the elements to prune by jointly training the binary masks with the model, ii) the capability of controlling the pruning level through hyper-parameters of a novel regularization term. We assess the effectiveness of our method by employing it in the structured pruning of the fully connected layers of shallow and deep neural networks where it outperforms the magnitude-based pruning approaches
2023
2023 IEEE 66th International Midwest Symposium on Circuits and Systems (MWSCAS)
1020
1024
Martinini, F., Enttsel, A., Marchioni, A., Mangia, M., Rovatti, R., Setti, G. (2023). Structured Pruning in Deep Neural Networks with Trainable Probability Masks. New York : IEEE [10.1109/mwscas57524.2023.10405945].
Martinini, F.; Enttsel, A.; Marchioni, A.; Mangia, M.; Rovatti, R.; Setti, G.
File in questo prodotto:
File Dimensione Formato  
MWCAS2023_pruning.pdf

accesso aperto

Tipo: Postprint
Licenza: Licenza per accesso libero gratuito
Dimensione 782.74 kB
Formato Adobe PDF
782.74 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/964800
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact