The rFBP project implements a scikit-learn compatible machine-learning binary classifier leveraging fully connected neural networks with a learning algorithm (Replicated Focusing Belief Propagation, rFBP) that is quickly converging and robust (less prone to brittle overfitting) for ill-posed datasets (very few samples compared to the number of features). The current implementation works only with binary features such as one-hot encoding for categorical data. This library has already been widely used to successfully predict source attribution starting from GWAS (Genome Wide Association Studies) data. That study was trying to predict the animal origin for an infectious bacterial disease inside the H2020 European project COMPARE (Grant agreement ID: 643476). A full description of the pipeline used in this study is available in the abstract and slides provided into the publications folder of the project. Algorithm application on real data: Classification of Genome Wide Association data by Belief Propagation Neural network, CCS Italy 2019, Conference paper Classification of Genome Wide Association data by Belief Propagation Neural network, CCS Italy 2019, Conference slides

rFBP: Replicated Focusing Belief Propagation algorithm

Curti, Nico
Co-primo
;
Dall’Olio, Daniele
Co-primo
;
Remondini, Daniel;Castellani, Gastone;Giampieri, Enrico
Ultimo
2020

Abstract

The rFBP project implements a scikit-learn compatible machine-learning binary classifier leveraging fully connected neural networks with a learning algorithm (Replicated Focusing Belief Propagation, rFBP) that is quickly converging and robust (less prone to brittle overfitting) for ill-posed datasets (very few samples compared to the number of features). The current implementation works only with binary features such as one-hot encoding for categorical data. This library has already been widely used to successfully predict source attribution starting from GWAS (Genome Wide Association Studies) data. That study was trying to predict the animal origin for an infectious bacterial disease inside the H2020 European project COMPARE (Grant agreement ID: 643476). A full description of the pipeline used in this study is available in the abstract and slides provided into the publications folder of the project. Algorithm application on real data: Classification of Genome Wide Association data by Belief Propagation Neural network, CCS Italy 2019, Conference paper Classification of Genome Wide Association data by Belief Propagation Neural network, CCS Italy 2019, Conference slides
2020
Curti, Nico; Dall’Olio, Daniele; Remondini, Daniel; Castellani, Gastone; Giampieri, Enrico
File in questo prodotto:
File Dimensione Formato  
11585_807617.pdf

accesso aperto

Tipo: Versione (PDF) editoriale
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 129.52 kB
Formato Adobe PDF
129.52 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/807617
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact