For mapping football (soccer) player information by using multidimensional scaling, and for clustering football players, we construct a distance measure based on players’ performance data. The variables are of mixed type, but the main focus of this paper is how count variables are treated when defining a proper distance measure between players (e.g., top and lower level variables). The distance construction involves four steps: 1) representation , 2) transformation, 3) standardisation, 4) variable weighting. Several distance measures are discussed in terms of how well they match the interpretation of distance and similarity in the application of interest, with a focus on comparing Aitchison and Manhattan distance for variables giving percentage compositions. Preliminary outcomes of multidimensional scaling and clustering are shown.
Akhanli S, Hennig C (2017). Some Issues in Distance Construction for Football Players Performance Data. ARCHIVES OF DATA SCIENCE, SERIES A, 2, 1-17 [10.5445/KSP/1000058749/09].
Some Issues in Distance Construction for Football Players Performance Data
Hennig C
2017
Abstract
For mapping football (soccer) player information by using multidimensional scaling, and for clustering football players, we construct a distance measure based on players’ performance data. The variables are of mixed type, but the main focus of this paper is how count variables are treated when defining a proper distance measure between players (e.g., top and lower level variables). The distance construction involves four steps: 1) representation , 2) transformation, 3) standardisation, 4) variable weighting. Several distance measures are discussed in terms of how well they match the interpretation of distance and similarity in the application of interest, with a focus on comparing Aitchison and Manhattan distance for variables giving percentage compositions. Preliminary outcomes of multidimensional scaling and clustering are shown.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.