Recently, prediction of structural/functional motifs in protein sequences takes advantage of powerful machine learning based approaches. Protein encoding adopts protein language models overpassing standard procedures. Different combinations of machine learning and encoding schemas are available for predicting different structural/functional motifs. Particularly interesting is the adoption of protein language models to encode proteins in addition to evolution information and physicochemical parameters. A thorough analysis of recent predictors developed for annotating transmembrane regions, sorting signals, lipidation and phosphorylation sites allows to investigate the state-of-the-art focusing on the relevance of protein language models for the different tasks. This highlights that more experimental data are necessary to exploit available powerful machine learning methods.

Savojardo C., Martelli P.L., Casadio R. (2023). Finding functional motifs in protein sequences with deep learning and natural language models. CURRENT OPINION IN STRUCTURAL BIOLOGY, 81, 1-12 [10.1016/j.sbi.2023.102641].

Finding functional motifs in protein sequences with deep learning and natural language models

Savojardo C.
Primo
;
Martelli P. L.
Secondo
;
Casadio R.
Ultimo
2023

Abstract

Recently, prediction of structural/functional motifs in protein sequences takes advantage of powerful machine learning based approaches. Protein encoding adopts protein language models overpassing standard procedures. Different combinations of machine learning and encoding schemas are available for predicting different structural/functional motifs. Particularly interesting is the adoption of protein language models to encode proteins in addition to evolution information and physicochemical parameters. A thorough analysis of recent predictors developed for annotating transmembrane regions, sorting signals, lipidation and phosphorylation sites allows to investigate the state-of-the-art focusing on the relevance of protein language models for the different tasks. This highlights that more experimental data are necessary to exploit available powerful machine learning methods.
2023
Savojardo C., Martelli P.L., Casadio R. (2023). Finding functional motifs in protein sequences with deep learning and natural language models. CURRENT OPINION IN STRUCTURAL BIOLOGY, 81, 1-12 [10.1016/j.sbi.2023.102641].
Savojardo C.; Martelli P.L.; Casadio R.
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0959440X2300115X-main.pdf

accesso aperto

Tipo: Versione (PDF) editoriale
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione - Non commerciale - Non opere derivate (CCBYNCND)
Dimensione 799.54 kB
Formato Adobe PDF
799.54 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/939937
Citazioni
  • ???jsp.display-item.citation.pmc??? 0
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 4
social impact