For mapping football (soccer) player information by using multidimensional scaling, and for clustering football players, we construct a distance measure based on players’ performance data. The variables are of mixed type, but the main focus of this paper is how count variables are treated when defining a proper distance measure between players (e.g., top and lower level variables). The distance construction involves four steps: 1) representation , 2) transformation, 3) standardisation, 4) variable weighting. Several distance measures are discussed in terms of how well they match the interpretation of distance and similarity in the application of interest, with a focus on comparing Aitchison and Manhattan distance for variables giving percentage compositions. Preliminary outcomes of multidimensional scaling and clustering are shown.

Akhanli S, Hennig C (2017). Some Issues in Distance Construction for Football Players Performance Data. ARCHIVES OF DATA SCIENCE, SERIES A, 2, 1-17 [10.5445/KSP/1000058749/09].

Some Issues in Distance Construction for Football Players Performance Data

Hennig C
2017

Abstract

For mapping football (soccer) player information by using multidimensional scaling, and for clustering football players, we construct a distance measure based on players’ performance data. The variables are of mixed type, but the main focus of this paper is how count variables are treated when defining a proper distance measure between players (e.g., top and lower level variables). The distance construction involves four steps: 1) representation , 2) transformation, 3) standardisation, 4) variable weighting. Several distance measures are discussed in terms of how well they match the interpretation of distance and similarity in the application of interest, with a focus on comparing Aitchison and Manhattan distance for variables giving percentage compositions. Preliminary outcomes of multidimensional scaling and clustering are shown.
2017
Akhanli S, Hennig C (2017). Some Issues in Distance Construction for Football Players Performance Data. ARCHIVES OF DATA SCIENCE, SERIES A, 2, 1-17 [10.5445/KSP/1000058749/09].
Akhanli S; Hennig C
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/680799
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact