Given a membrane protein it is very important to know which membrane type it belongs to because this kind of information can provide clues for better understanding its function. In this work, we propose a system for predicting the membrane proteins type directly from the amino acid sequence, the feature extraction step is based on an encoding technique that combines the physicochemical amino-acid properties with the residue couple model. The residue couple model is a method inspired by Chou’s quasi-sequence-order model that extracts the features utilizing the sequence order effect indirectly. A set of support vector machines, each trained using a different physicochemical amino-acid property combined with the residue couple model, are combined by vote rule. The success rate obtained by our system on a difficult dataset, where the sequences in a given membrane type have a low sequence identity to any other proteins of the same membrane type, are quite high indicating that the proposed method, where the features are extracted directly from the amino acid sequence, is a feasible system for predicting the membrane proteins type. Keywords: DOI: 10.1007/s00726-008-0083-0

Nanni, L., Lumini, A. (2008). An ensemble of Support Vector Machines for predicting the membrane proteins type directly from the amino acid sequence. AMINO ACIDS, 35, 573-580 [10.1007/s00726-008-0083-0].

An ensemble of Support Vector Machines for predicting the membrane proteins type directly from the amino acid sequence

NANNI, LORIS;LUMINI, ALESSANDRA
2008

Abstract

Given a membrane protein it is very important to know which membrane type it belongs to because this kind of information can provide clues for better understanding its function. In this work, we propose a system for predicting the membrane proteins type directly from the amino acid sequence, the feature extraction step is based on an encoding technique that combines the physicochemical amino-acid properties with the residue couple model. The residue couple model is a method inspired by Chou’s quasi-sequence-order model that extracts the features utilizing the sequence order effect indirectly. A set of support vector machines, each trained using a different physicochemical amino-acid property combined with the residue couple model, are combined by vote rule. The success rate obtained by our system on a difficult dataset, where the sequences in a given membrane type have a low sequence identity to any other proteins of the same membrane type, are quite high indicating that the proposed method, where the features are extracted directly from the amino acid sequence, is a feasible system for predicting the membrane proteins type. Keywords: DOI: 10.1007/s00726-008-0083-0
2008
Nanni, L., Lumini, A. (2008). An ensemble of Support Vector Machines for predicting the membrane proteins type directly from the amino acid sequence. AMINO ACIDS, 35, 573-580 [10.1007/s00726-008-0083-0].
Nanni, Loris; Lumini, Alessandra
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/63188
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 38
  • ???jsp.display-item.citation.isi??? 35
social impact