An important part of textual information around the world contains some kind of geographic features. User queries with geographic references are becoming very common and human expectations from a search engine are even higher. Although several works have been focused on this area, the interpretation of the geographic information in order to better satisfy the user needs continues being a challenge. This work proposes different techniques which are involved in the process of identifying and analyzing the geographic information in textual documents and queries in natural languages. A geographic ontology GeoNW has been built by combining GeoNames, WordNet and Wikipedia resources. Based on the information stored in GeoNW, geographic terms are identified and an algorithm for solving the toponym disambiguation problem is proposed. Once the geographic information is processed, we obtain a geographic ranking list of documents which is combined with a standard textual ranking list of documents for producing the final results. Geo--CLEF test collection is used for evaluating the accuracy of the result.

Geographic information extraction, disambiguation and ranking techniques / Linares Zaila, Yisleidy; Montesi, Danilo. - ELETTRONICO. - 26-27-:(2015), pp. 2837695.1-2837695.7. (Intervento presentato al convegno 9th Workshop on Geographic Information Retrieval, GIR 2015 tenutosi a France nel 2015) [10.1145/2837689.2837695].

Geographic information extraction, disambiguation and ranking techniques

LINARES ZAILA, YISLEIDY;MONTESI, DANILO
2015

Abstract

An important part of textual information around the world contains some kind of geographic features. User queries with geographic references are becoming very common and human expectations from a search engine are even higher. Although several works have been focused on this area, the interpretation of the geographic information in order to better satisfy the user needs continues being a challenge. This work proposes different techniques which are involved in the process of identifying and analyzing the geographic information in textual documents and queries in natural languages. A geographic ontology GeoNW has been built by combining GeoNames, WordNet and Wikipedia resources. Based on the information stored in GeoNW, geographic terms are identified and an algorithm for solving the toponym disambiguation problem is proposed. Once the geographic information is processed, we obtain a geographic ranking list of documents which is combined with a standard textual ranking list of documents for producing the final results. Geo--CLEF test collection is used for evaluating the accuracy of the result.
2015
Proceedings of the 9th Workshop on Geographic Information Retrieval
1
7
Geographic information extraction, disambiguation and ranking techniques / Linares Zaila, Yisleidy; Montesi, Danilo. - ELETTRONICO. - 26-27-:(2015), pp. 2837695.1-2837695.7. (Intervento presentato al convegno 9th Workshop on Geographic Information Retrieval, GIR 2015 tenutosi a France nel 2015) [10.1145/2837689.2837695].
Linares Zaila, Yisleidy; Montesi, Danilo
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/548442
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? ND
social impact