CRIS Current Research Information System

Open information extraction approaches are useful but insufficient alone for populating the Web with machine readable information as their results are not directly linkable to, and immediately reusable from, other Linked Data sources. This work proposes a novel paradigm, named Open Knowledge Extraction, and its implementation (Legalo) that performs unsupervised, open domain, and abstractive knowledge extraction from text for producing machine readable information. The implemented method is based on the hypothesis that hyperlinks (either created by humans or knowledge extraction tools) provide a pragmatic trace of semantic relations between two entities, and that such semantic relations, their subjects and objects, can be revealed by processing their linguistic traces (i.e. the sentences that embed the hyperlinks) and formalised as Semantic Web triples and ontology axioms. Experimental evaluations conducted on validated text extracted from Wikipedia pages, with the help of crowdsourcing, confirm this hypothesis showing high performances. A demo is available at http://wit.istc.cnr.it/stlab-tools/legalo.

Presutti V, N.A. (2016). From hyperlinks to Semantic Web properties using Open Knowledge Extraction. SEMANTIC WEB, 7(4), 351-378 [10.3233/SW-160221].

From hyperlinks to Semantic Web properties using Open Knowledge Extraction

Presutti V;Nuzzolese AG;Consoli S;Gangemi A;Recupero DR

2016

Abstract

Open information extraction approaches are useful but insufficient alone for populating the Web with machine readable information as their results are not directly linkable to, and immediately reusable from, other Linked Data sources. This work proposes a novel paradigm, named Open Knowledge Extraction, and its implementation (Legalo) that performs unsupervised, open domain, and abstractive knowledge extraction from text for producing machine readable information. The implemented method is based on the hypothesis that hyperlinks (either created by humans or knowledge extraction tools) provide a pragmatic trace of semantic relations between two entities, and that such semantic relations, their subjects and objects, can be revealed by processing their linguistic traces (i.e. the sentences that embed the hyperlinks) and formalised as Semantic Web triples and ontology axioms. Experimental evaluations conducted on validated text extracted from Wikipedia pages, with the help of crowdsourcing, confirm this hypothesis showing high performances. A demo is available at http://wit.istc.cnr.it/stlab-tools/legalo.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2016
			
	Rivista
	
				SEMANTIC WEB
			
	Codice DOI
	
				https://dx.doi.org/10.3233/SW-160221
			
	Citazione
	
				Presutti V, N.A. (2016). From hyperlinks to Semantic Web properties using Open Knowledge Extraction. SEMANTIC WEB, 7(4), 351-378 [10.3233/SW-160221].
			
	Tutti gli autori
	
						Presutti V, Nuzzolese AG, Consoli S, Gangemi A, Recupero DR
					
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Presutti 2016_Semantic Web.pdf accesso aperto Tipo: Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review Licenza: Licenza per accesso libero gratuito Dimensione 3.1 MB Formato Adobe PDF Visualizza/Apri	3.1 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/620521

Citazioni

ND

25

13

ND

social impact