CRIS Current Research Information System

Supervised models based on Transformers have been shown to achieve impressive performances in many natural language processing tasks. However, besides requiring a large amount of costly manually annotated data, supervised models tend to adapt to the characteristics of the training dataset, which are usually created ad-hoc and whose data distribution often differs from the one in real applications, showing significant performance degradation in real-world scenarios. We perform an extensive assessment of the out-of-distribution performances of supervised models for classification in the emotion and hate-speech detection tasks and show that NLI-based zero-shot models often outperform them, making task-specific annotation useless when the characteristics of final-user data are not known in advance. To benefit from both supervised and zero-shot approaches, we propose to fine-tune an NLI-based model on the task-specific dataset. The resulting model often outperforms all available supervised models both in distribution and out of distribution, with only a few thousand training samples.

Bulla, L., Gangemi, A., Mongiovi, M. (2023). Towards Distribution-shift Robust Text Classification of Emotional Content. Stroudsburg, PA 18360 : Association for Computational Linguistics (ACL) [10.18653/v1/2023.findings-acl.524].

Towards Distribution-shift Robust Text Classification of Emotional Content

Bulla L.;Gangemi A.;Mongiovi M.

2023

Abstract

Supervised models based on Transformers have been shown to achieve impressive performances in many natural language processing tasks. However, besides requiring a large amount of costly manually annotated data, supervised models tend to adapt to the characteristics of the training dataset, which are usually created ad-hoc and whose data distribution often differs from the one in real applications, showing significant performance degradation in real-world scenarios. We perform an extensive assessment of the out-of-distribution performances of supervised models for classification in the emotion and hate-speech detection tasks and show that NLI-based zero-shot models often outperform them, making task-specific annotation useless when the characteristics of final-user data are not known in advance. To benefit from both supervised and zero-shot approaches, we propose to fine-tune an NLI-based model on the task-specific dataset. The resulting model often outperforms all available supervised models both in distribution and out of distribution, with only a few thousand training samples.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Titolo del volume
	
				Findings of the Association for Computational Linguistics: ACL 2023
			
	Pagina iniziale
	
				8256
			
	Pagina finale
	
				8268
			
	Collana/Serie
	
				PROCEEDINGS OF THE CONFERENCE - ASSOCIATION FOR COMPUTATIONAL LINGUISTICS. MEETING
			
	Codice DOI
	
				https://dx.doi.org/10.18653/v1/2023.findings-acl.524
			
	Citazione
	
				Bulla, L., Gangemi, A., Mongiovi, M. (2023). Towards Distribution-shift Robust Text Classification of Emotional Content. Stroudsburg, PA 18360 : Association for Computational Linguistics (ACL) [10.18653/v1/2023.findings-acl.524].
			
	Tutti gli autori
	
						Bulla, L.; Gangemi, A.; Mongiovi, M.
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
Towards Distribution-shift Robust Text Classification.pdf accesso aperto Descrizione: Contributo Tipo: Versione (PDF) editoriale / Version Of Record Licenza: Licenza per accesso libero gratuito Dimensione 290.68 kB Formato Adobe PDF Visualizza/Apri	290.68 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/956837

Citazioni

ND

3

ND

social impact