The paper presents CHerIDesCo – Cultural Heritage - Italian Description Corpus, a domain-specific linguistic resource designed for the training and testing of novel NLP tools in the Cultural Heritage field. The corpus has been developed by the UNIOR NLP Research group as a part of the SMACH project, a three-year project funded by the National Operative Program to pursue the Smart Specialization Strategy defined by the EU. The project aims at improving language-based human-computer interaction in the Cultural Heritage domain through the development of innovative applications for multilingual access to the contents based on semantic language technologies. In particular, the paper describes the design of the CHerIDesCo corpus, the annotation procedures, and the platforms where the resource has been uploaded. As pointed out in the conclusion, this linguistic resource can be exploited in several NLP tasks (e.g., NER – Named-Entity Recognition, NEL – Named-Entity Linking, and Topic Modeling).

Risorse e applicazioni computazionali per l’accesso ai beni culturali: il Corpus CHerIDesCo / Gloria Gagliardi ; Massimo Guarino. - In: CHIMERA. - ISSN 2386-2629. - ELETTRONICO. - 8:(2021), pp. 25-43.

Risorse e applicazioni computazionali per l’accesso ai beni culturali: il Corpus CHerIDesCo.

Gloria Gagliardi
Primo
;
2021

Abstract

The paper presents CHerIDesCo – Cultural Heritage - Italian Description Corpus, a domain-specific linguistic resource designed for the training and testing of novel NLP tools in the Cultural Heritage field. The corpus has been developed by the UNIOR NLP Research group as a part of the SMACH project, a three-year project funded by the National Operative Program to pursue the Smart Specialization Strategy defined by the EU. The project aims at improving language-based human-computer interaction in the Cultural Heritage domain through the development of innovative applications for multilingual access to the contents based on semantic language technologies. In particular, the paper describes the design of the CHerIDesCo corpus, the annotation procedures, and the platforms where the resource has been uploaded. As pointed out in the conclusion, this linguistic resource can be exploited in several NLP tasks (e.g., NER – Named-Entity Recognition, NEL – Named-Entity Linking, and Topic Modeling).
2021
Risorse e applicazioni computazionali per l’accesso ai beni culturali: il Corpus CHerIDesCo / Gloria Gagliardi ; Massimo Guarino. - In: CHIMERA. - ISSN 2386-2629. - ELETTRONICO. - 8:(2021), pp. 25-43.
Gloria Gagliardi ; Massimo Guarino
File in questo prodotto:
File Dimensione Formato  
Articolo in rivista.pdf

accesso aperto

Descrizione: Articolo
Tipo: Versione (PDF) editoriale
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 1.27 MB
Formato Adobe PDF
1.27 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/826424
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact