CRIS Current Research Information System

Disruptive situations are emotionally-charged events diverging from ordinary behavior, like people fighting or screaming. Public transports are one type of social environment where disruptive situation may occur, and their timely detection may bring significant improvements to people's safety. Current approaches to disruptive situation detection, typically based on CCTVs, do not take the emotional dimension into account. Conversely, we propose to frame such a problem as a speech emotion recognition task. To validate our hypotheses, we carry out an extensive experimental study focusing on the development of a model characterized by speaker/gender independence, robustness to noise, and robustness against multiple voices. We investigate a variety of audio features, classifiers, datasets, and data augmentation methods in an effort to define effective ways to address this under-investigated yet socially significant problem. Our experiments show that the proposed systems attain an F1 score of over 90% on the disruptive class, even when introducing noisy elements such as environmental noise or multiple overlapping voices. This robust performance is achieved with datasets characterized by speaker variability, gender diversity, and varying number of samples. Such promising results indicate that framing disruptive situation detection as a speech emotion recognition task could pave the way to the adoption of new types of intelligent systems with a positive impact on public safety.

Eleonora Mancini, Andrea Galassi, Federico Ruggeri, Paolo Torroni (2024). Disruptive situation detection on public transport through speech emotion recognition. INTELLIGENT SYSTEMS WITH APPLICATIONS, 21, 1-10 [10.1016/j.iswa.2023.200305].

Disruptive situation detection on public transport through speech emotion recognition

Eleonora Mancini^Primo;Andrea Galassi;Federico Ruggeri;Paolo Torroni

2024

Abstract

Disruptive situations are emotionally-charged events diverging from ordinary behavior, like people fighting or screaming. Public transports are one type of social environment where disruptive situation may occur, and their timely detection may bring significant improvements to people's safety. Current approaches to disruptive situation detection, typically based on CCTVs, do not take the emotional dimension into account. Conversely, we propose to frame such a problem as a speech emotion recognition task. To validate our hypotheses, we carry out an extensive experimental study focusing on the development of a model characterized by speaker/gender independence, robustness to noise, and robustness against multiple voices. We investigate a variety of audio features, classifiers, datasets, and data augmentation methods in an effort to define effective ways to address this under-investigated yet socially significant problem. Our experiments show that the proposed systems attain an F1 score of over 90% on the disruptive class, even when introducing noisy elements such as environmental noise or multiple overlapping voices. This robust performance is achieved with datasets characterized by speaker variability, gender diversity, and varying number of samples. Such promising results indicate that framing disruptive situation detection as a speech emotion recognition task could pave the way to the adoption of new types of intelligent systems with a positive impact on public safety.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Rivista
	
				INTELLIGENT SYSTEMS WITH APPLICATIONS
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.iswa.2023.200305
			
	Citazione
	
				Eleonora Mancini,  Andrea Galassi,  Federico Ruggeri,  Paolo Torroni (2024). Disruptive situation detection on public transport through speech emotion recognition. INTELLIGENT SYSTEMS WITH APPLICATIONS, 21, 1-10 [10.1016/j.iswa.2023.200305].
			
	Tutti gli autori
	
						Eleonora Mancini; Andrea Galassi; Federico Ruggeri; Paolo Torroni
					
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S2667305323001308-main.pdf accesso aperto Tipo: Versione (PDF) editoriale / Version Of Record Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY) Dimensione 796.44 kB Formato Adobe PDF Visualizza/Apri	796.44 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/949865

Citazioni

ND

14

4

7

social impact