This paper presents methodology and software for ensuring data quality in open scholarly bibliographic collections. Considering the case study of OpenCitations Meta and OpenCitations Index, storing bibliographic metadata and citations respectively, two tools are introduced: a data validator and a data monitor. The validator checks the syntactic and semantic correctness of bibliographic data before ingestion, providing both machine-readable reports and user-friendly feedback. The monitor tracks known data issues post-ingestion using SPARQL queries, ensuring ongoing data integrity. Designed with accessibility in mind, both tools facilitate automated workflows and user interaction.
Peroni, S., Rizzetto, E. (2025). A Tool for Validating and Monitoring Bibliographic Data in Open Research Information Systems: the OpenCitations Collections.
A Tool for Validating and Monitoring Bibliographic Data in Open Research Information Systems: the OpenCitations Collections
Peroni S.;Rizzetto E.
2025
Abstract
This paper presents methodology and software for ensuring data quality in open scholarly bibliographic collections. Considering the case study of OpenCitations Meta and OpenCitations Index, storing bibliographic metadata and citations respectively, two tools are introduced: a data validator and a data monitor. The validator checks the syntactic and semantic correctness of bibliographic data before ingestion, providing both machine-readable reports and user-friendly feedback. The monitor tracks known data issues post-ingestion using SPARQL queries, ensuring ongoing data integrity. Designed with accessibility in mind, both tools facilitate automated workflows and user interaction.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



