CRIS Current Research Information System

Various disconnected chord datasets are currently available for music analysis and information retrieval, but they are often limited by either their size, non-openness, lack of timed information, and interoperability. Together with the lack of overlapping repertoire coverage, this limits cross-corpus studies on harmony over time and across genres, and hampers research in computational music analysis (chord recognition, pattern mining, computational creativity), which needs access to large datasets. We contribute to address this gap, by releasing the Chord Corpus (ChoCo), a large-scale dataset that semantically integrates harmonic data from 18 different sources using heterogeneous representations and formats (Harte, Leadsheet, Roman numerals, ABC, etc.). We rely on JAMS (JSON Annotated Music Specification), a popular data structure for annotations in Music Information Retrieval, to represent and enrich chord-related information (chord, key, mode, etc.) in a uniform way. To achieve semantic integration, we design a novel ontology for modelling music annotations and the entities they involve (artists, scores, etc.), and we build a 30M-triple knowledge graph, including 4 K+ links to other datasets (MIDI-LD, LED).

de Berardinis J., Meroño-Peñuela A., Poltronieri A., Presutti V. (2023). ChoCo: a Chord Corpus and a Data Transformation Workflow for Musical Harmony Knowledge Graphs. SCIENTIFIC DATA, 10, 1-25 [10.1038/s41597-023-02410-w].

ChoCo: a Chord Corpus and a Data Transformation Workflow for Musical Harmony Knowledge Graphs

Poltronieri A.^{Co-primo

Methodology};Presutti V.^{Co-primo

Project Administration}

2023

Abstract

Various disconnected chord datasets are currently available for music analysis and information retrieval, but they are often limited by either their size, non-openness, lack of timed information, and interoperability. Together with the lack of overlapping repertoire coverage, this limits cross-corpus studies on harmony over time and across genres, and hampers research in computational music analysis (chord recognition, pattern mining, computational creativity), which needs access to large datasets. We contribute to address this gap, by releasing the Chord Corpus (ChoCo), a large-scale dataset that semantically integrates harmonic data from 18 different sources using heterogeneous representations and formats (Harte, Leadsheet, Roman numerals, ABC, etc.). We rely on JAMS (JSON Annotated Music Specification), a popular data structure for annotations in Music Information Retrieval, to represent and enrich chord-related information (chord, key, mode, etc.) in a uniform way. To achieve semantic integration, we design a novel ontology for modelling music annotations and the entities they involve (artists, scores, etc.), and we build a 30M-triple knowledge graph, including 4 K+ links to other datasets (MIDI-LD, LED).

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Rivista
	
				SCIENTIFIC DATA
			
	Codice DOI
	
				https://dx.doi.org/10.1038/s41597-023-02410-w
			
	Citazione
	
				de Berardinis J.,  Meroño-Peñuela A.,  Poltronieri A.,  Presutti V. (2023). ChoCo: a Chord Corpus and a Data Transformation Workflow for Musical Harmony Knowledge Graphs. SCIENTIFIC DATA, 10, 1-25 [10.1038/s41597-023-02410-w].
			
	Tutti gli autori
	
						de Berardinis J.; Meroño-Peñuela A.; Poltronieri A.; Presutti V.
					
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
s41597-023-02410-w.pdf accesso aperto Descrizione: ChoCo: a Chord Corpus and a Data Transformation Workflow for Musical Harmony Knowledge Graphs Tipo: Versione (PDF) editoriale / Version Of Record Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY) Dimensione 3.04 MB Formato Adobe PDF Visualizza/Apri	3.04 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/969603

Citazioni

1

17

7

16

social impact