Motivation: Predicting the functional impact of a protein variation is one of the most challenging problems in Bioinformatics. A rapidly growing number of genome-scale studies provide large amounts of experimental data allowing the application of rigorous statistical approaches for predicting if a given single point mutation has or not an impact on human health. Up until now, existing methods have limited their source data to either protein or gene information. Novel in this work, we take advantage of both and focus on protein evolutionary information by using estimated selective pressures at the codon level. Results: Here we introduce a new method (SeqProfCod) to predict the likeliness that a given protein variant is associated or not with a human disease. We have developed a Support Vector Machine classifier trained using three sources of information: protein sequence, multiple protein sequence alignments and the estimation of selective pressure at the codon level. SeqProfCod has been benchmarked with a large dataset of 9,979 single point mutations from 1,599 human proteins from SWISS-PROT. It achieves 77% overall accuracy and a correlation coefficient of 0.49 indicating that the estimation of the selective pressure helps in predicting the functional impact of single-point mutations. Thus, this study demonstrates the synergic effect of combining the two classical sources of information for predicting the functional effects of protein variants: protein sequence/profile-based information and selective pressures at codon level.

Capriotti E., Arbiza L., Casadio R., Dopazo J., Dopazo H., Marti-Renom M.A. (2008). Selective pressure at the codon level improves the prediction of disease related protein mutations in human. VALENCIA : s.n.

Selective pressure at the codon level improves the prediction of disease related protein mutations in human

CAPRIOTTI, EMIDIO;CASADIO, RITA;
2008

Abstract

Motivation: Predicting the functional impact of a protein variation is one of the most challenging problems in Bioinformatics. A rapidly growing number of genome-scale studies provide large amounts of experimental data allowing the application of rigorous statistical approaches for predicting if a given single point mutation has or not an impact on human health. Up until now, existing methods have limited their source data to either protein or gene information. Novel in this work, we take advantage of both and focus on protein evolutionary information by using estimated selective pressures at the codon level. Results: Here we introduce a new method (SeqProfCod) to predict the likeliness that a given protein variant is associated or not with a human disease. We have developed a Support Vector Machine classifier trained using three sources of information: protein sequence, multiple protein sequence alignments and the estimation of selective pressure at the codon level. SeqProfCod has been benchmarked with a large dataset of 9,979 single point mutations from 1,599 human proteins from SWISS-PROT. It achieves 77% overall accuracy and a correlation coefficient of 0.49 indicating that the estimation of the selective pressure helps in predicting the functional impact of single-point mutations. Thus, this study demonstrates the synergic effect of combining the two classical sources of information for predicting the functional effects of protein variants: protein sequence/profile-based information and selective pressures at codon level.
2008
VIII Jornadas de Bioinformatica, Valencia
28
28
Capriotti E., Arbiza L., Casadio R., Dopazo J., Dopazo H., Marti-Renom M.A. (2008). Selective pressure at the codon level improves the prediction of disease related protein mutations in human. VALENCIA : s.n.
Capriotti E.; Arbiza L.; Casadio R.; Dopazo J.; Dopazo H.; Marti-Renom M.A.
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/74005
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact