The current trend of over-parameterized Deep Neural Networks makes the deployment on resource constrained systems challenging. To deal with this, optimization techniques, such as network pruning, can be adopted. We propose a novel pruning technique based on trainable probability masks that, when binarized, select the elements of the network to prune. Our method features i) an automatic selections of the elements to prune by jointly training the binary masks with the model, ii) the capability of controlling the pruning level through hyper-parameters of a novel regularization term. We assess the effectiveness of our method by employing it in the structured pruning of the fully connected layers of shallow and deep neural networks where it outperforms the magnitude-based pruning approaches
Martinini, F., Enttsel, A., Marchioni, A., Mangia, M., Rovatti, R., Setti, G. (2023). Structured Pruning in Deep Neural Networks with Trainable Probability Masks. New York : IEEE [10.1109/mwscas57524.2023.10405945].
Structured Pruning in Deep Neural Networks with Trainable Probability Masks
Martinini, F.;Enttsel, A.;Marchioni, A.;Mangia, M.;Rovatti, R.;
2023
Abstract
The current trend of over-parameterized Deep Neural Networks makes the deployment on resource constrained systems challenging. To deal with this, optimization techniques, such as network pruning, can be adopted. We propose a novel pruning technique based on trainable probability masks that, when binarized, select the elements of the network to prune. Our method features i) an automatic selections of the elements to prune by jointly training the binary masks with the model, ii) the capability of controlling the pruning level through hyper-parameters of a novel regularization term. We assess the effectiveness of our method by employing it in the structured pruning of the fully connected layers of shallow and deep neural networks where it outperforms the magnitude-based pruning approachesFile | Dimensione | Formato | |
---|---|---|---|
MWCAS2023_pruning.pdf
accesso aperto
Tipo:
Postprint
Licenza:
Licenza per accesso libero gratuito
Dimensione
782.74 kB
Formato
Adobe PDF
|
782.74 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.