WordNet-like Lexical Databases (WLDs) group English words into synsets, being utilized in several text mining applications. Synsets were also open to criticism, because while synset members (wordsenses) are, in practice, considered as compeers, yet in theory not all of them represent the synset meaning with a same degree. Considering this criticism, fuzzy synsets (considering synsets as fuzzy sets) have been proposed. In this study, we show why the standard fuzzy synsets do not properly-enough model the membership uncertainty, and propose an upgraded version of them in which membership degrees are represented by intervals (similar to what in Interval Type 2 Fuzzy Sets). We present an algorithm for constructing the interval fuzzy version of WLDs of a language, given a large enough multicontextual corpus of documents and a precise enough word-sense-disambiguation (WSD) system of that language. Utilizing the algorithm, we produced interval fuzzy synsets of English WordNet (for the frequent-enough synsets). For evaluation, we compared the results with crowdsourced data, asking people to rate the min/max compatibility degree of wordsenses of a synset with its definition. Comparisons, promisingly, showed the algorithm accuracy. The algorithm has also the drawback of being applicable only for synsets with wordsenses having enough frequency in all the corpus categories. This drawback is going to be covered in our future work.

Towards Interval Version of Fuzzy Synsets / Hossayni SA, Rajati MR, Del Acebo E, Reforgiato Recupero, D, Gangemi A, Akbarzadeht MR, De La Rosa Esteva JL. - STAMPA. - (2016), pp. 297-302. (Intervento presentato al convegno 19th International Conference of the Catalan Association for Artificial Intelligence (CCIA 2016) tenutosi a Barcelona nel 19.10.2016 - 21.10.2016) [10.3233/978-1-61499-696-5-297].

Towards Interval Version of Fuzzy Synsets

GANGEMI, ALDO
Writing – Review & Editing
2016

Abstract

WordNet-like Lexical Databases (WLDs) group English words into synsets, being utilized in several text mining applications. Synsets were also open to criticism, because while synset members (wordsenses) are, in practice, considered as compeers, yet in theory not all of them represent the synset meaning with a same degree. Considering this criticism, fuzzy synsets (considering synsets as fuzzy sets) have been proposed. In this study, we show why the standard fuzzy synsets do not properly-enough model the membership uncertainty, and propose an upgraded version of them in which membership degrees are represented by intervals (similar to what in Interval Type 2 Fuzzy Sets). We present an algorithm for constructing the interval fuzzy version of WLDs of a language, given a large enough multicontextual corpus of documents and a precise enough word-sense-disambiguation (WSD) system of that language. Utilizing the algorithm, we produced interval fuzzy synsets of English WordNet (for the frequent-enough synsets). For evaluation, we compared the results with crowdsourced data, asking people to rate the min/max compatibility degree of wordsenses of a synset with its definition. Comparisons, promisingly, showed the algorithm accuracy. The algorithm has also the drawback of being applicable only for synsets with wordsenses having enough frequency in all the corpus categories. This drawback is going to be covered in our future work.
2016
19th International Conference of the Catalan Association for Artificial Intelligence, CCIA 2016
297
302
Towards Interval Version of Fuzzy Synsets / Hossayni SA, Rajati MR, Del Acebo E, Reforgiato Recupero, D, Gangemi A, Akbarzadeht MR, De La Rosa Esteva JL. - STAMPA. - (2016), pp. 297-302. (Intervento presentato al convegno 19th International Conference of the Catalan Association for Artificial Intelligence (CCIA 2016) tenutosi a Barcelona nel 19.10.2016 - 21.10.2016) [10.3233/978-1-61499-696-5-297].
Hossayni SA, Rajati MR, Del Acebo E, Reforgiato Recupero, D, Gangemi A, Akbarzadeht MR, De La Rosa Esteva JL
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/620603
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact