The last decade has seen an explosion in the collection of protein data. To actualized the potential offered by this wealth of data, it is important to develop machine systems capable of classifying and extracting features from proteins. Reliable machine systems for protein classification offers many benefits, including the promise of finding novel drugs and vaccines. In developing our system, we analyze and compare several feature extraction methods used in protein classification that are based on the calculation of texture descriptors starting from a wavelet representation of the protein. We then feed these texture-based representations of the protein into an Adaboost ensemble of neural network or a support vector machine classifier. In addition, we perform experiments that combine our feature extraction methods with a standard method that is based on the Chou's pseudo amino acid composition. Using several datasets, we show that our best approach outperforms standard methods. The Matlab code of the proposed protein descriptors is available at bias.csr.unibo.it/nanni/wave.rar.

L. Nanni, S. Brahnam, A. Lumini (2012). Wavelet images and Chou's pseudo amino acid composition for protein classification. AMINO ACIDS, 43(2), 657-665 [10.1007/s00726-011-1114-9].

Wavelet images and Chou's pseudo amino acid composition for protein classification

LUMINI, ALESSANDRA
2012

Abstract

The last decade has seen an explosion in the collection of protein data. To actualized the potential offered by this wealth of data, it is important to develop machine systems capable of classifying and extracting features from proteins. Reliable machine systems for protein classification offers many benefits, including the promise of finding novel drugs and vaccines. In developing our system, we analyze and compare several feature extraction methods used in protein classification that are based on the calculation of texture descriptors starting from a wavelet representation of the protein. We then feed these texture-based representations of the protein into an Adaboost ensemble of neural network or a support vector machine classifier. In addition, we perform experiments that combine our feature extraction methods with a standard method that is based on the Chou's pseudo amino acid composition. Using several datasets, we show that our best approach outperforms standard methods. The Matlab code of the proposed protein descriptors is available at bias.csr.unibo.it/nanni/wave.rar.
2012
L. Nanni, S. Brahnam, A. Lumini (2012). Wavelet images and Chou's pseudo amino acid composition for protein classification. AMINO ACIDS, 43(2), 657-665 [10.1007/s00726-011-1114-9].
L. Nanni; S. Brahnam; A. Lumini
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/133739
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 103
  • ???jsp.display-item.citation.isi??? 98
social impact