Intrinsically disordered proteins (IDPs) are important for a broad range of biological functions and are in- volved in many diseases. An understanding of in- trinsic disorder is key to develop compounds that target IDPs. Experimental characterization of IDPs is hindered by the very fact that they are highly dy- namic. Computational methods that predict disor- der from the amino acid sequence have been pro- posed. Here, we present ADOPT (Attention DisOr- der PredicTor), a new predictor of protein disorder. ADOPT is composed of a self-supervised encoder and a supervised disorder predictor. The former is based on a deep bidirectional transformer, which extracts dense residue-level representations from Facebook’s Evolutionary Scale Modeling library. The latter uses a database of nuclear magnetic reso- nance chemical shifts, constructed to ensure bal- anced amounts of disordered and ordered residues, as a training and a test dataset for protein disor- der. ADOPT predicts whether a protein or a spe- cific region is disordered with better performance than the best existing predictors and faster than most other proposed methods (a few seconds per sequence). We identify the features that are rele- vant for the prediction performance and show that good performance can already be gained with <100 features. ADOPT is available as a stand-alone pack- age at https://github.com/PeptoneLtd/ADOPT and as a web server at https://adopt.peptone.io/.

Redl, I., Fisicaro, C., Dutton, O., Hoffmann, F., Henderson, L., Owens, B.M.J., et al. (2022). ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers. NAR GENOMICS AND BIOINFORMATICS, 5(2), 1-14 [10.1093/nargab/lqad041].

ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers

Emanuele Paci;
2022

Abstract

Intrinsically disordered proteins (IDPs) are important for a broad range of biological functions and are in- volved in many diseases. An understanding of in- trinsic disorder is key to develop compounds that target IDPs. Experimental characterization of IDPs is hindered by the very fact that they are highly dy- namic. Computational methods that predict disor- der from the amino acid sequence have been pro- posed. Here, we present ADOPT (Attention DisOr- der PredicTor), a new predictor of protein disorder. ADOPT is composed of a self-supervised encoder and a supervised disorder predictor. The former is based on a deep bidirectional transformer, which extracts dense residue-level representations from Facebook’s Evolutionary Scale Modeling library. The latter uses a database of nuclear magnetic reso- nance chemical shifts, constructed to ensure bal- anced amounts of disordered and ordered residues, as a training and a test dataset for protein disor- der. ADOPT predicts whether a protein or a spe- cific region is disordered with better performance than the best existing predictors and faster than most other proposed methods (a few seconds per sequence). We identify the features that are rele- vant for the prediction performance and show that good performance can already be gained with <100 features. ADOPT is available as a stand-alone pack- age at https://github.com/PeptoneLtd/ADOPT and as a web server at https://adopt.peptone.io/.
2022
Redl, I., Fisicaro, C., Dutton, O., Hoffmann, F., Henderson, L., Owens, B.M.J., et al. (2022). ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers. NAR GENOMICS AND BIOINFORMATICS, 5(2), 1-14 [10.1093/nargab/lqad041].
Redl, Istvan; Fisicaro, Carlo; Dutton, Oliver; Hoffmann, Falk; Henderson, Louie; Owens, Benjamin M. J.; Heberling, Matthew; Paci, Emanuele; Tamiola, K...espandi
File in questo prodotto:
File Dimensione Formato  
redl22.pdf

accesso aperto

Tipo: Versione (PDF) editoriale
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale (CCBYNC)
Dimensione 2.96 MB
Formato Adobe PDF
2.96 MB Adobe PDF Visualizza/Apri
lqad041_supplemental_file.pdf

accesso aperto

Tipo: File Supplementare
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale (CCBYNC)
Dimensione 436.11 kB
Formato Adobe PDF
436.11 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/967478
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 9
social impact