The object-oriented metrics suite proposed by Chidamber and Kemerer (CK) is a measurement 17 approach towards improved object-oriented design and development practices. However, existing studies 18 evidence traces of collinearity between some of the metrics and low ranges of other metrics, two facts which 19 may endanger the validity of models based on the CK suite. As high correlation may be an indicator of 20 collinearity, in this paper, we empirically determine to what extent high correlations and low ranges might be 21 expected among CK metrics. 22 To draw as much general conclusions as possible, we extract the CK metrics from a large data set (200 23 public domain projects) and we apply statistical meta-analysis techniques to strengthen the validity of our 24 results. Homogeneously through the projects, we found a moderate (~0.50) to high correlation (>0.80) between 25 some of the metrics and low ranges of other metrics. 26 Results of this empirical analysis supply researchers and practitioners with three main advises: a) to avoid the 27 use in prediction systems of CK metrics that have correlation more than 0.80 b) to test for collinearity those 28 metrics that present moderate correlations (between 0.50 and 0.60) c) to avoid the use as response in continuous 29 parametric regression analysis of the metrics presenting low variance. This might therefore suggest that a 30 prediction system may not be based on the whole CK metrics suite, but only on a subset consisting of those 31 metrics that do not present either high correlation or low ranges.
Succi G, Pedrycz W, Djokic S, Russo B, Zuliani P (2005). An Empirical Exploration of the Distributions of the Chidamber and Kemerer Object-Oriented Metrics Suite. EMPIRICAL SOFTWARE ENGINEERING, 10 (1), 81-104.
An Empirical Exploration of the Distributions of the Chidamber and Kemerer Object-Oriented Metrics Suite
Succi G;
2005
Abstract
The object-oriented metrics suite proposed by Chidamber and Kemerer (CK) is a measurement 17 approach towards improved object-oriented design and development practices. However, existing studies 18 evidence traces of collinearity between some of the metrics and low ranges of other metrics, two facts which 19 may endanger the validity of models based on the CK suite. As high correlation may be an indicator of 20 collinearity, in this paper, we empirically determine to what extent high correlations and low ranges might be 21 expected among CK metrics. 22 To draw as much general conclusions as possible, we extract the CK metrics from a large data set (200 23 public domain projects) and we apply statistical meta-analysis techniques to strengthen the validity of our 24 results. Homogeneously through the projects, we found a moderate (~0.50) to high correlation (>0.80) between 25 some of the metrics and low ranges of other metrics. 26 Results of this empirical analysis supply researchers and practitioners with three main advises: a) to avoid the 27 use in prediction systems of CK metrics that have correlation more than 0.80 b) to test for collinearity those 28 metrics that present moderate correlations (between 0.50 and 0.60) c) to avoid the use as response in continuous 29 parametric regression analysis of the metrics presenting low variance. This might therefore suggest that a 30 prediction system may not be based on the whole CK metrics suite, but only on a subset consisting of those 31 metrics that do not present either high correlation or low ranges.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.