Misogyny is often expressed through figurative language. Some neutral words can assume a negative connotation when functioning as pejorative epithets. Disambiguating the meaning of such terms might help the detection of misogyny. In order to address such task, we present PejorativITy, a novel corpus of 1,200 manually annotated Italian tweets for pejorative language at the word level and misogyny at the sentence level. We evaluate the impact of injecting information about disambiguated words into a model targeting misogyny detection. In particular, we explore two different approaches for injection: concatenation of pejorative information and substitution of ambiguous words with univocal terms. Our experimental results, both on our corpus and on two popular benchmarks on Italian tweets, show that both approaches lead to a major classification improvement, indicating that word sense disambiguation is a promising preliminary step for misogyny detection. Furthermore, we investigate LLMs{'} understanding of pejorative epithets by means of contextual word embeddings analysis and prompting.

Muti, A., Ruggeri, F., Toraman, C., Barrón-Cedeño, A., Algherini, S., Musetti, L., et al. (2024). PejorativITy: Disambiguating Pejorative Epithets to Improve Misogyny Detection in Italian Tweets. ELRA and ICCL.

PejorativITy: Disambiguating Pejorative Epithets to Improve Misogyny Detection in Italian Tweets

Arianna Muti
;
Federico Ruggeri;Alberto Barrón-Cedeño;
2024

Abstract

Misogyny is often expressed through figurative language. Some neutral words can assume a negative connotation when functioning as pejorative epithets. Disambiguating the meaning of such terms might help the detection of misogyny. In order to address such task, we present PejorativITy, a novel corpus of 1,200 manually annotated Italian tweets for pejorative language at the word level and misogyny at the sentence level. We evaluate the impact of injecting information about disambiguated words into a model targeting misogyny detection. In particular, we explore two different approaches for injection: concatenation of pejorative information and substitution of ambiguous words with univocal terms. Our experimental results, both on our corpus and on two popular benchmarks on Italian tweets, show that both approaches lead to a major classification improvement, indicating that word sense disambiguation is a promising preliminary step for misogyny detection. Furthermore, we investigate LLMs{'} understanding of pejorative epithets by means of contextual word embeddings analysis and prompting.
2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
12700
12711
Muti, A., Ruggeri, F., Toraman, C., Barrón-Cedeño, A., Algherini, S., Musetti, L., et al. (2024). PejorativITy: Disambiguating Pejorative Epithets to Improve Misogyny Detection in Italian Tweets. ELRA and ICCL.
Muti, Arianna; Ruggeri, Federico; Toraman, Cagri; Barrón-Cedeño, Alberto; Algherini, Samuel; Musetti, Lorenzo; Ronchi, Silvia; Saretto, Gianmarco; Zap...espandi
File in questo prodotto:
File Dimensione Formato  
2024.lrec-main.1112.pdf

accesso aperto

Tipo: Versione (PDF) editoriale / Version Of Record
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale (CCBYNC)
Dimensione 398.71 kB
Formato Adobe PDF
398.71 kB Adobe PDF Visualizza/Apri
2024.lrec-main.1112.OptionalSupplementaryMaterial.xlsx

accesso aperto

Tipo: File Supplementare
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale (CCBYNC)
Dimensione 127.96 kB
Formato Microsoft Excel XML
127.96 kB Microsoft Excel XML Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/973179
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact