Supervised models based on Transformers have been shown to achieve impressive performances in many natural language processing tasks. However, besides requiring a large amount of costly manually annotated data, supervised models tend to adapt to the characteristics of the training dataset, which are usually created ad-hoc and whose data distribution often differs from the one in real applications, showing significant performance degradation in real-world scenarios. We perform an extensive assessment of the out-of-distribution performances of supervised models for classification in the emotion and hate-speech detection tasks and show that NLI-based zero-shot models often outperform them, making task-specific annotation useless when the characteristics of final-user data are not known in advance. To benefit from both supervised and zero-shot approaches, we propose to fine-tune an NLI-based model on the task-specific dataset. The resulting model often outperforms all available supervised models both in distribution and out of distribution, with only a few thousand training samples.

Bulla L., Gangemi A., Mongiovi M. (2023). Towards Distribution-shift Robust Text Classification of Emotional Content. Stroudsburg, PA 18360 : Association for Computational Linguistics (ACL) [10.18653/v1/2023.findings-acl.524].

Towards Distribution-shift Robust Text Classification of Emotional Content

Gangemi A.
;
2023

Abstract

Supervised models based on Transformers have been shown to achieve impressive performances in many natural language processing tasks. However, besides requiring a large amount of costly manually annotated data, supervised models tend to adapt to the characteristics of the training dataset, which are usually created ad-hoc and whose data distribution often differs from the one in real applications, showing significant performance degradation in real-world scenarios. We perform an extensive assessment of the out-of-distribution performances of supervised models for classification in the emotion and hate-speech detection tasks and show that NLI-based zero-shot models often outperform them, making task-specific annotation useless when the characteristics of final-user data are not known in advance. To benefit from both supervised and zero-shot approaches, we propose to fine-tune an NLI-based model on the task-specific dataset. The resulting model often outperforms all available supervised models both in distribution and out of distribution, with only a few thousand training samples.
2023
Proceedings of the Annual Meeting of the Association for Computational Linguistics
8256
8268
Bulla L., Gangemi A., Mongiovi M. (2023). Towards Distribution-shift Robust Text Classification of Emotional Content. Stroudsburg, PA 18360 : Association for Computational Linguistics (ACL) [10.18653/v1/2023.findings-acl.524].
Bulla L.; Gangemi A.; Mongiovi M.
File in questo prodotto:
File Dimensione Formato  
Towards Distribution-shift Robust Text Classification.pdf

accesso aperto

Descrizione: Contributo
Tipo: Versione (PDF) editoriale
Licenza: Licenza per accesso libero gratuito
Dimensione 290.68 kB
Formato Adobe PDF
290.68 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/956837
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact