To achieve scientific progress in terms of building a cumulative body of knowledge, careful attention to benchmarking is of the utmost importance, requiring that proposals of new methods are extensively and carefully compared with their best predecessors, and existing methods subjected to neutral comparison studies. Answers to benchmarking questions should be evidence-based, with the relevant evidence being collected through well-thought-out procedures, in reproducible and replicable ways. In the present paper, we review good research practices in benchmarking from the perspective of the area of cluster analysis. Discussion is given to the theoretical, conceptual underpinnings of benchmarking based on simulated and empirical data in this context. Subsequently, the practicalities of how to address benchmarking questions in clustering are dealt with, and foundational recommendations are made based on existing literature.This article is categorized under:Fundamental Concepts of Data and Knowledge > Data ConceptsFundamental Concepts of Data and Knowledge > Key Design Issues in Data MiningTechnologies > Structure Discovery and Clustering
Iven Van Mechelen, Anne-Laure Boulesteix, Rainer Dangl, Nema Dean, Christian Hennig, Friedrich Leisch, et al. (2023). A white paper on good research practices in benchmarking: The case of cluster analysis. WILEY INTERDISCIPLINARY REVIEWS. DATA MINING AND KNOWLEDGE DISCOVERY, 13(6 (November/December)), 1-20 [10.1002/widm.1511].
A white paper on good research practices in benchmarking: The case of cluster analysis
Iven Van Mechelen
Primo
;Christian Hennig;
2023
Abstract
To achieve scientific progress in terms of building a cumulative body of knowledge, careful attention to benchmarking is of the utmost importance, requiring that proposals of new methods are extensively and carefully compared with their best predecessors, and existing methods subjected to neutral comparison studies. Answers to benchmarking questions should be evidence-based, with the relevant evidence being collected through well-thought-out procedures, in reproducible and replicable ways. In the present paper, we review good research practices in benchmarking from the perspective of the area of cluster analysis. Discussion is given to the theoretical, conceptual underpinnings of benchmarking based on simulated and empirical data in this context. Subsequently, the practicalities of how to address benchmarking questions in clustering are dealt with, and foundational recommendations are made based on existing literature.This article is categorized under:Fundamental Concepts of Data and Knowledge > Data ConceptsFundamental Concepts of Data and Knowledge > Key Design Issues in Data MiningTechnologies > Structure Discovery and ClusteringFile | Dimensione | Formato | |
---|---|---|---|
WIREs Data Min Knowl - 2023 - Van Mechelen - A white paper on good research practices in benchmarking The case of.pdf
accesso aperto
Tipo:
Versione (PDF) editoriale
Licenza:
Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione
2.47 MB
Formato
Adobe PDF
|
2.47 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.