Patient data collection is a routine procedure in many hospitals and clinical departments, typically demographics, administrative and clinical data; these data are characterized by different precision and accuracy, with mixed format which includes quantitative, qualitative and structured representation. This paper presents a system to allow an easy and reliable classification of mixed data as collected in a cardiology department. The method is based an the following steps: mixed data are mapped in a non Euclidean space according to the generalized Minkowski metric; a cost function is used to weight the transformed data and to compensate far their unequal precision; a clustering approach allows the finding of the number of clusters of a given data set; a cluster interpretation is obtained by graphic representation of results through a reduction of features by means of Principal Component Analysis. The system is user oriented, that means the user is free to select art automated procedure on. all the patients or to choose a subset of patients and/or features to drive the system at each step. The system is under evaluation by using a 7000 patients database, characterized by 126 mixed features.

Macerata, A., Del Corso, F., Michelassi, C., Varanini, M., Taddei, A., Marchesi, C. (1996). An interactive clustering procedure for selection of high dimension patterns. 345 E 47TH ST, NEW YORK, NY 10017 : I E E E [10.1109/CIC.1996.542550].

An interactive clustering procedure for selection of high dimension patterns

Del Corso, F.;
1996

Abstract

Patient data collection is a routine procedure in many hospitals and clinical departments, typically demographics, administrative and clinical data; these data are characterized by different precision and accuracy, with mixed format which includes quantitative, qualitative and structured representation. This paper presents a system to allow an easy and reliable classification of mixed data as collected in a cardiology department. The method is based an the following steps: mixed data are mapped in a non Euclidean space according to the generalized Minkowski metric; a cost function is used to weight the transformed data and to compensate far their unequal precision; a clustering approach allows the finding of the number of clusters of a given data set; a cluster interpretation is obtained by graphic representation of results through a reduction of features by means of Principal Component Analysis. The system is user oriented, that means the user is free to select art automated procedure on. all the patients or to choose a subset of patients and/or features to drive the system at each step. The system is under evaluation by using a 7000 patients database, characterized by 126 mixed features.
1996
COMPUTERS IN CARDIOLOGY
369
372
Macerata, A., Del Corso, F., Michelassi, C., Varanini, M., Taddei, A., Marchesi, C. (1996). An interactive clustering procedure for selection of high dimension patterns. 345 E 47TH ST, NEW YORK, NY 10017 : I E E E [10.1109/CIC.1996.542550].
Macerata, A.; Del Corso, F.; Michelassi, C.; Varanini, M.; Taddei, A.; Marchesi, C.
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/797705
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 1
social impact