The separation and analysis of sound sources play a key role in assessing noise exposure in work environments. Sound level meter analyses do not allow technicians to quantify the noise's contribution of each sound source in a manifold context. This reveals a great lack of detail in plenty of cases. In office spaces, for example, different noise components can increase or decrease the employees' productivity. Machine and deep learning provide techniques to analyze large volumes of data. A previous study proposed two clustering techniques, the Gaussian Mixture Model and K-means, to analyze long-term measurements of the working activity within offices. In the present work, the daily activity of an office is analyzed through unsupervised clustering techniques and a semi-supervised analysis via a Variational autoencoder model. The first method exploits the sound pressure levels obtained by a sound level meter. The second method exploits the spectrograms obtained by the digital audio recording of the same activity, to obtain more detailed information about the sources. Results show good performance of both methods to identify and separate the traffic noise from the speech signal. The latent space of the Variational autoencoder can be a qualitative validation tool for the clustering methods.

de Salvio D., Bianco M.J., Gerstoft P., D'Orazio D., Garai M. (2022). VARIATIONAL AUTOENCODER AND CLUSTERING TECHNIQUES TO SEGREGATE SOUND SOURCES. Gyeongju : Acoustical Society of Korea.

VARIATIONAL AUTOENCODER AND CLUSTERING TECHNIQUES TO SEGREGATE SOUND SOURCES

de Salvio D.
;
D'Orazio D.;Garai M.
2022

Abstract

The separation and analysis of sound sources play a key role in assessing noise exposure in work environments. Sound level meter analyses do not allow technicians to quantify the noise's contribution of each sound source in a manifold context. This reveals a great lack of detail in plenty of cases. In office spaces, for example, different noise components can increase or decrease the employees' productivity. Machine and deep learning provide techniques to analyze large volumes of data. A previous study proposed two clustering techniques, the Gaussian Mixture Model and K-means, to analyze long-term measurements of the working activity within offices. In the present work, the daily activity of an office is analyzed through unsupervised clustering techniques and a semi-supervised analysis via a Variational autoencoder model. The first method exploits the sound pressure levels obtained by a sound level meter. The second method exploits the spectrograms obtained by the digital audio recording of the same activity, to obtain more detailed information about the sources. Results show good performance of both methods to identify and separate the traffic noise from the speech signal. The latent space of the Variational autoencoder can be a qualitative validation tool for the clustering methods.
2022
Proceedings of the 24th International Congress on Acoustics ICA 2022
1
5
de Salvio D., Bianco M.J., Gerstoft P., D'Orazio D., Garai M. (2022). VARIATIONAL AUTOENCODER AND CLUSTERING TECHNIQUES TO SEGREGATE SOUND SOURCES. Gyeongju : Acoustical Society of Korea.
de Salvio D.; Bianco M.J.; Gerstoft P.; D'Orazio D.; Garai M.
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/970414
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact