To achieve scientific progress in terms of building a cumulative body of knowledge, careful attention to benchmarking is of the utmost importance, requiring that proposals of new methods are extensively and carefully compared with their best predecessors, and existing methods subjected to neutral comparison studies. Answers to benchmarking questions should be evidence-based, with the relevant evidence being collected through well-thought-out procedures, in reproducible and replicable ways. In the present paper, we review good research practices in benchmarking from the perspective of the area of cluster analysis. Discussion is given to the theoretical, conceptual underpinnings of benchmarking based on simulated and empirical data in this context. Subsequently, the practicalities of how to address benchmarking questions in clustering are dealt with, and foundational recommendations are made based on existing literature.This article is categorized under:Fundamental Concepts of Data and Knowledge > Data ConceptsFundamental Concepts of Data and Knowledge > Key Design Issues in Data MiningTechnologies > Structure Discovery and Clustering

Iven Van Mechelen, Anne-Laure Boulesteix, Rainer Dangl, Nema Dean, Christian Hennig, Friedrich Leisch, et al. (2023). A white paper on good research practices in benchmarking: The case of cluster analysis. WILEY INTERDISCIPLINARY REVIEWS. DATA MINING AND KNOWLEDGE DISCOVERY, 13(6 (November/December)), 1-20 [10.1002/widm.1511].

A white paper on good research practices in benchmarking: The case of cluster analysis

Iven Van Mechelen
Primo
;
Christian Hennig;
2023

Abstract

To achieve scientific progress in terms of building a cumulative body of knowledge, careful attention to benchmarking is of the utmost importance, requiring that proposals of new methods are extensively and carefully compared with their best predecessors, and existing methods subjected to neutral comparison studies. Answers to benchmarking questions should be evidence-based, with the relevant evidence being collected through well-thought-out procedures, in reproducible and replicable ways. In the present paper, we review good research practices in benchmarking from the perspective of the area of cluster analysis. Discussion is given to the theoretical, conceptual underpinnings of benchmarking based on simulated and empirical data in this context. Subsequently, the practicalities of how to address benchmarking questions in clustering are dealt with, and foundational recommendations are made based on existing literature.This article is categorized under:Fundamental Concepts of Data and Knowledge > Data ConceptsFundamental Concepts of Data and Knowledge > Key Design Issues in Data MiningTechnologies > Structure Discovery and Clustering
2023
Iven Van Mechelen, Anne-Laure Boulesteix, Rainer Dangl, Nema Dean, Christian Hennig, Friedrich Leisch, et al. (2023). A white paper on good research practices in benchmarking: The case of cluster analysis. WILEY INTERDISCIPLINARY REVIEWS. DATA MINING AND KNOWLEDGE DISCOVERY, 13(6 (November/December)), 1-20 [10.1002/widm.1511].
Iven Van Mechelen; Anne-Laure Boulesteix; Rainer Dangl; Nema Dean; Christian Hennig; Friedrich Leisch; Douglas Steinley; Matthijs J. Warrens
File in questo prodotto:
File Dimensione Formato  
WIREs Data Min Knowl - 2023 - Van Mechelen - A white paper on good research practices in benchmarking The case of.pdf

accesso aperto

Tipo: Versione (PDF) editoriale
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 2.47 MB
Formato Adobe PDF
2.47 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/949307
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? 6
social impact