To address the classification problem when the number of cases is too small to effectively use just a single technique, this paper suggests to use a “hybrid” approach, that combines tree classifiers, expert knowledge and Bayesian networks. The output of the “hybrid” strategy takes the form of a Bayesian network, where the serious drawback of requiring huge amounts of data was overcome by coupling the network with another classifier and using expert knowledge. The technique was applied to two clinical case-studies and its predictive performance was compared with the performances of the single approaches. The results show that the proposed technique benefits from several advantages of the three single approaches and outperforms all of them: it shows a better sensitivity, that plays a crucial role in classifying new patients into the high-risk category, and is competitive with regards to specificity. Therefore, even though additional studies are needed to validate the “hybrid” approach, it seems a promising technique to develop reliable classification systems for small datasets.

Learning Bayesian Networks from Classification trees and Expert knowledge: a preliminary study

STRACQUALURSI, LUISA;AGATI, PATRIZIA
2015

Abstract

To address the classification problem when the number of cases is too small to effectively use just a single technique, this paper suggests to use a “hybrid” approach, that combines tree classifiers, expert knowledge and Bayesian networks. The output of the “hybrid” strategy takes the form of a Bayesian network, where the serious drawback of requiring huge amounts of data was overcome by coupling the network with another classifier and using expert knowledge. The technique was applied to two clinical case-studies and its predictive performance was compared with the performances of the single approaches. The results show that the proposed technique benefits from several advantages of the three single approaches and outperforms all of them: it shows a better sensitivity, that plays a crucial role in classifying new patients into the high-risk category, and is competitive with regards to specificity. Therefore, even though additional studies are needed to validate the “hybrid” approach, it seems a promising technique to develop reliable classification systems for small datasets.
2015
ALL IN ONE CONFERENCES - Electronic Book of Proceedings
92
95
Stracqualursi L.; Agati P.
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/508577
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact