Intrinsically disordered proteins (IDPs) are important for a broad range of biological functions and are in- volved in many diseases. An understanding of in- trinsic disorder is key to develop compounds that target IDPs. Experimental characterization of IDPs is hindered by the very fact that they are highly dy- namic. Computational methods that predict disor- der from the amino acid sequence have been pro- posed. Here, we present ADOPT (Attention DisOr- der PredicTor), a new predictor of protein disorder. ADOPT is composed of a self-supervised encoder and a supervised disorder predictor. The former is based on a deep bidirectional transformer, which extracts dense residue-level representations from Facebook’s Evolutionary Scale Modeling library. The latter uses a database of nuclear magnetic reso- nance chemical shifts, constructed to ensure bal- anced amounts of disordered and ordered residues, as a training and a test dataset for protein disor- der. ADOPT predicts whether a protein or a spe- cific region is disordered with better performance than the best existing predictors and faster than most other proposed methods (a few seconds per sequence). We identify the features that are rele- vant for the prediction performance and show that good performance can already be gained with <100 features. ADOPT is available as a stand-alone pack- age at https://github.com/PeptoneLtd/ADOPT and as a web server at https://adopt.peptone.io/.
Redl, I., Fisicaro, C., Dutton, O., Hoffmann, F., Henderson, L., Owens, B.M.J., et al. (2022). ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers. NAR GENOMICS AND BIOINFORMATICS, 5(2), 1-14 [10.1093/nargab/lqad041].
ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers
Emanuele Paci;
2022
Abstract
Intrinsically disordered proteins (IDPs) are important for a broad range of biological functions and are in- volved in many diseases. An understanding of in- trinsic disorder is key to develop compounds that target IDPs. Experimental characterization of IDPs is hindered by the very fact that they are highly dy- namic. Computational methods that predict disor- der from the amino acid sequence have been pro- posed. Here, we present ADOPT (Attention DisOr- der PredicTor), a new predictor of protein disorder. ADOPT is composed of a self-supervised encoder and a supervised disorder predictor. The former is based on a deep bidirectional transformer, which extracts dense residue-level representations from Facebook’s Evolutionary Scale Modeling library. The latter uses a database of nuclear magnetic reso- nance chemical shifts, constructed to ensure bal- anced amounts of disordered and ordered residues, as a training and a test dataset for protein disor- der. ADOPT predicts whether a protein or a spe- cific region is disordered with better performance than the best existing predictors and faster than most other proposed methods (a few seconds per sequence). We identify the features that are rele- vant for the prediction performance and show that good performance can already be gained with <100 features. ADOPT is available as a stand-alone pack- age at https://github.com/PeptoneLtd/ADOPT and as a web server at https://adopt.peptone.io/.File | Dimensione | Formato | |
---|---|---|---|
redl22.pdf
accesso aperto
Tipo:
Versione (PDF) editoriale
Licenza:
Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale (CCBYNC)
Dimensione
2.96 MB
Formato
Adobe PDF
|
2.96 MB | Adobe PDF | Visualizza/Apri |
lqad041_supplemental_file.pdf
accesso aperto
Tipo:
File Supplementare
Licenza:
Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale (CCBYNC)
Dimensione
436.11 kB
Formato
Adobe PDF
|
436.11 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.