This paper presents the E- MIMIC project, an application that aims to eliminate non-inclusive, prejudiced language forms in administrative texts written in European countries, starting with those written in Romance languages. It presents a methodology based on discourse criteria inspired by French discourse analysis and used to label a corpus of institutional documents, which are used for the deep learning of neural networks. Deep Language Modelling architectures are exploited to automatically identify non-inclusive text snippets, suggest alternative forms, and produce inclusive text rephrasing. A preliminary evaluation conducted on a benchmark dataset in Italian shows promising results and encourages us to finalise the application and to implement it also for other languages, such as French.

Raus, R., Tonti, M., Cerquitelli, T., Cagliero, L., Attanasio, G., La Quatra, M., et al. (2022). L’analyse du discours et l’intelligence artificielle pour réaliser une écriture inclusive : le projet E- MIMIC. Les Ulis : EDP Sciences [10.1051/shsconf/202213801007].

L’analyse du discours et l’intelligence artificielle pour réaliser une écriture inclusive : le projet E- MIMIC

Rachele Raus;Michela Tonti;
2022

Abstract

This paper presents the E- MIMIC project, an application that aims to eliminate non-inclusive, prejudiced language forms in administrative texts written in European countries, starting with those written in Romance languages. It presents a methodology based on discourse criteria inspired by French discourse analysis and used to label a corpus of institutional documents, which are used for the deep learning of neural networks. Deep Language Modelling architectures are exploited to automatically identify non-inclusive text snippets, suggest alternative forms, and produce inclusive text rephrasing. A preliminary evaluation conducted on a benchmark dataset in Italian shows promising results and encourages us to finalise the application and to implement it also for other languages, such as French.
2022
8e Congrès Mondial de Linguistique Française
1
15
Raus, R., Tonti, M., Cerquitelli, T., Cagliero, L., Attanasio, G., La Quatra, M., et al. (2022). L’analyse du discours et l’intelligence artificielle pour réaliser une écriture inclusive : le projet E- MIMIC. Les Ulis : EDP Sciences [10.1051/shsconf/202213801007].
Raus, Rachele; Tonti, Michela; Cerquitelli, Tania; Cagliero, Luca; Attanasio, Giuseppe; La Quatra, Moreno; Greco, Salvatore
File in questo prodotto:
File Dimensione Formato  
shsconf_cmlf2022_01007.pdf

accesso aperto

Tipo: Versione (PDF) editoriale / Version Of Record
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 3.1 MB
Formato Adobe PDF
3.1 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/889304
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact