Cargando…

ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers

Intrinsically disordered proteins (IDPs) are important for a broad range of biological functions and are involved in many diseases. An understanding of intrinsic disorder is key to develop compounds that target IDPs. Experimental characterization of IDPs is hindered by the very fact that they are hi...

Descripción completa

Detalles Bibliográficos
Autores principales: Redl, Istvan, Fisicaro, Carlo, Dutton, Oliver, Hoffmann, Falk, Henderson, Louie, Owens, Benjamin M J, Heberling, Matthew, Paci, Emanuele, Tamiola, Kamil
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10150328/
https://www.ncbi.nlm.nih.gov/pubmed/37138579
http://dx.doi.org/10.1093/nargab/lqad041
_version_ 1785035340350750720
author Redl, Istvan
Fisicaro, Carlo
Dutton, Oliver
Hoffmann, Falk
Henderson, Louie
Owens, Benjamin M J
Heberling, Matthew
Paci, Emanuele
Tamiola, Kamil
author_facet Redl, Istvan
Fisicaro, Carlo
Dutton, Oliver
Hoffmann, Falk
Henderson, Louie
Owens, Benjamin M J
Heberling, Matthew
Paci, Emanuele
Tamiola, Kamil
author_sort Redl, Istvan
collection PubMed
description Intrinsically disordered proteins (IDPs) are important for a broad range of biological functions and are involved in many diseases. An understanding of intrinsic disorder is key to develop compounds that target IDPs. Experimental characterization of IDPs is hindered by the very fact that they are highly dynamic. Computational methods that predict disorder from the amino acid sequence have been proposed. Here, we present ADOPT (Attention DisOrder PredicTor), a new predictor of protein disorder. ADOPT is composed of a self-supervised encoder and a supervised disorder predictor. The former is based on a deep bidirectional transformer, which extracts dense residue-level representations from Facebook’s Evolutionary Scale Modeling library. The latter uses a database of nuclear magnetic resonance chemical shifts, constructed to ensure balanced amounts of disordered and ordered residues, as a training and a test dataset for protein disorder. ADOPT predicts whether a protein or a specific region is disordered with better performance than the best existing predictors and faster than most other proposed methods (a few seconds per sequence). We identify the features that are relevant for the prediction performance and show that good performance can already be gained with <100 features. ADOPT is available as a stand-alone package at https://github.com/PeptoneLtd/ADOPT and as a web server at https://adopt.peptone.io/.
format Online
Article
Text
id pubmed-10150328
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101503282023-05-02 ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers Redl, Istvan Fisicaro, Carlo Dutton, Oliver Hoffmann, Falk Henderson, Louie Owens, Benjamin M J Heberling, Matthew Paci, Emanuele Tamiola, Kamil NAR Genom Bioinform Standard Article Intrinsically disordered proteins (IDPs) are important for a broad range of biological functions and are involved in many diseases. An understanding of intrinsic disorder is key to develop compounds that target IDPs. Experimental characterization of IDPs is hindered by the very fact that they are highly dynamic. Computational methods that predict disorder from the amino acid sequence have been proposed. Here, we present ADOPT (Attention DisOrder PredicTor), a new predictor of protein disorder. ADOPT is composed of a self-supervised encoder and a supervised disorder predictor. The former is based on a deep bidirectional transformer, which extracts dense residue-level representations from Facebook’s Evolutionary Scale Modeling library. The latter uses a database of nuclear magnetic resonance chemical shifts, constructed to ensure balanced amounts of disordered and ordered residues, as a training and a test dataset for protein disorder. ADOPT predicts whether a protein or a specific region is disordered with better performance than the best existing predictors and faster than most other proposed methods (a few seconds per sequence). We identify the features that are relevant for the prediction performance and show that good performance can already be gained with <100 features. ADOPT is available as a stand-alone package at https://github.com/PeptoneLtd/ADOPT and as a web server at https://adopt.peptone.io/. Oxford University Press 2023-05-01 /pmc/articles/PMC10150328/ /pubmed/37138579 http://dx.doi.org/10.1093/nargab/lqad041 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Standard Article
Redl, Istvan
Fisicaro, Carlo
Dutton, Oliver
Hoffmann, Falk
Henderson, Louie
Owens, Benjamin M J
Heberling, Matthew
Paci, Emanuele
Tamiola, Kamil
ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers
title ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers
title_full ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers
title_fullStr ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers
title_full_unstemmed ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers
title_short ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers
title_sort adopt: intrinsic protein disorder prediction through deep bidirectional transformers
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10150328/
https://www.ncbi.nlm.nih.gov/pubmed/37138579
http://dx.doi.org/10.1093/nargab/lqad041
work_keys_str_mv AT redlistvan adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers
AT fisicarocarlo adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers
AT duttonoliver adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers
AT hoffmannfalk adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers
AT hendersonlouie adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers
AT owensbenjaminmj adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers
AT heberlingmatthew adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers
AT paciemanuele adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers
AT tamiolakamil adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers