The Web of Data has grown explosively over the past few years, and as with any dataset, there are bound to be invalid statements in the data, as well as gaps. Natural Language Processing (NLP) is gaining interest to fill gaps in data by transforming (unstructured) text into structured data. However, there is currently a fundamental mismatch in approaches between Linked Data and NLP as the latter is often based on statistical methods, and the former on explicitly modelling knowledge. However, these fields can strengthen each other by joining forces. In this position paper, we argue that using linked data to validate the output of an NLP system, and using textual data to validate Linked Open Data (LOD) cloud statements is a promising research avenue. We illustrate our proposal with a proof of concept on a corpus of historical travel stories.

A proposal for a two-way journey on validating locations in unstructured and structured data / Keles I.; Qawasmeh O.; Tietz T.; Marinucci L.; Reda R.; van Erp M.. - ELETTRONICO. - 70:(2019), pp. 13.131-13.138. (Intervento presentato al convegno 2nd Conference on Language, Data and Knowledge, LDK 2019 tenutosi a Leipzig, Germany nel 2019) [10.4230/OASIcs.LDK.2019.13].

A proposal for a two-way journey on validating locations in unstructured and structured data

Reda R.;
2019

Abstract

The Web of Data has grown explosively over the past few years, and as with any dataset, there are bound to be invalid statements in the data, as well as gaps. Natural Language Processing (NLP) is gaining interest to fill gaps in data by transforming (unstructured) text into structured data. However, there is currently a fundamental mismatch in approaches between Linked Data and NLP as the latter is often based on statistical methods, and the former on explicitly modelling knowledge. However, these fields can strengthen each other by joining forces. In this position paper, we argue that using linked data to validate the output of an NLP system, and using textual data to validate Linked Open Data (LOD) cloud statements is a promising research avenue. We illustrate our proposal with a proof of concept on a corpus of historical travel stories.
2019
OpenAccess Series in Informatics
131
138
A proposal for a two-way journey on validating locations in unstructured and structured data / Keles I.; Qawasmeh O.; Tietz T.; Marinucci L.; Reda R.; van Erp M.. - ELETTRONICO. - 70:(2019), pp. 13.131-13.138. (Intervento presentato al convegno 2nd Conference on Language, Data and Knowledge, LDK 2019 tenutosi a Leipzig, Germany nel 2019) [10.4230/OASIcs.LDK.2019.13].
Keles I.; Qawasmeh O.; Tietz T.; Marinucci L.; Reda R.; van Erp M.
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/735583
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact