Most music production nowadays is carried out using software tools: for this reason, the market demands faithful audio effect simulations. Traditional methods for modeling nonlinear systems are effect-specific or labor-intensive; however, recent works yielded promising results by black-box simulation of these effects using neural networks. This work aims to explore two models of distortion effects based on autoencoders: one makes use of fully-connected layers only, and the other employs convolutional layers. Both models were trained using clean sounds as input and distorted sounds as target, thus, the learning method was not self-supervised, as it is mostly the case when dealing with autoencoders. The networks were then tested with visual inspection of the output spectrograms, as well as with an informal listening test, and performed well in reconstructing the distorted signal spectra, however a fair amount of noise was also introduced.

Modeling Audio Distortion Effects with Autoencoder Neural Networks / Russo R.; Bigoni F.; Palamas G.. - ELETTRONICO. - 377:(2021), pp. 131-141. (Intervento presentato al convegno 12th EAI International Conference on Intelligent Technologies for Interactive Entertainment, INTETAIN 2020 tenutosi a Santa Clara, United States nel 2020) [10.1007/978-3-030-76426-5_9].

Modeling Audio Distortion Effects with Autoencoder Neural Networks

Russo R.;
2021

Abstract

Most music production nowadays is carried out using software tools: for this reason, the market demands faithful audio effect simulations. Traditional methods for modeling nonlinear systems are effect-specific or labor-intensive; however, recent works yielded promising results by black-box simulation of these effects using neural networks. This work aims to explore two models of distortion effects based on autoencoders: one makes use of fully-connected layers only, and the other employs convolutional layers. Both models were trained using clean sounds as input and distorted sounds as target, thus, the learning method was not self-supervised, as it is mostly the case when dealing with autoencoders. The networks were then tested with visual inspection of the output spectrograms, as well as with an informal listening test, and performed well in reconstructing the distorted signal spectra, however a fair amount of noise was also introduced.
2021
Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST
131
141
Modeling Audio Distortion Effects with Autoencoder Neural Networks / Russo R.; Bigoni F.; Palamas G.. - ELETTRONICO. - 377:(2021), pp. 131-141. (Intervento presentato al convegno 12th EAI International Conference on Intelligent Technologies for Interactive Entertainment, INTETAIN 2020 tenutosi a Santa Clara, United States nel 2020) [10.1007/978-3-030-76426-5_9].
Russo R.; Bigoni F.; Palamas G.
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/857890
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact