Cargando…
ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers
Intrinsically disordered proteins (IDPs) are important for a broad range of biological functions and are involved in many diseases. An understanding of intrinsic disorder is key to develop compounds that target IDPs. Experimental characterization of IDPs is hindered by the very fact that they are hi...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10150328/ https://www.ncbi.nlm.nih.gov/pubmed/37138579 http://dx.doi.org/10.1093/nargab/lqad041 |
_version_ | 1785035340350750720 |
---|---|
author | Redl, Istvan Fisicaro, Carlo Dutton, Oliver Hoffmann, Falk Henderson, Louie Owens, Benjamin M J Heberling, Matthew Paci, Emanuele Tamiola, Kamil |
author_facet | Redl, Istvan Fisicaro, Carlo Dutton, Oliver Hoffmann, Falk Henderson, Louie Owens, Benjamin M J Heberling, Matthew Paci, Emanuele Tamiola, Kamil |
author_sort | Redl, Istvan |
collection | PubMed |
description | Intrinsically disordered proteins (IDPs) are important for a broad range of biological functions and are involved in many diseases. An understanding of intrinsic disorder is key to develop compounds that target IDPs. Experimental characterization of IDPs is hindered by the very fact that they are highly dynamic. Computational methods that predict disorder from the amino acid sequence have been proposed. Here, we present ADOPT (Attention DisOrder PredicTor), a new predictor of protein disorder. ADOPT is composed of a self-supervised encoder and a supervised disorder predictor. The former is based on a deep bidirectional transformer, which extracts dense residue-level representations from Facebook’s Evolutionary Scale Modeling library. The latter uses a database of nuclear magnetic resonance chemical shifts, constructed to ensure balanced amounts of disordered and ordered residues, as a training and a test dataset for protein disorder. ADOPT predicts whether a protein or a specific region is disordered with better performance than the best existing predictors and faster than most other proposed methods (a few seconds per sequence). We identify the features that are relevant for the prediction performance and show that good performance can already be gained with <100 features. ADOPT is available as a stand-alone package at https://github.com/PeptoneLtd/ADOPT and as a web server at https://adopt.peptone.io/. |
format | Online Article Text |
id | pubmed-10150328 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-101503282023-05-02 ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers Redl, Istvan Fisicaro, Carlo Dutton, Oliver Hoffmann, Falk Henderson, Louie Owens, Benjamin M J Heberling, Matthew Paci, Emanuele Tamiola, Kamil NAR Genom Bioinform Standard Article Intrinsically disordered proteins (IDPs) are important for a broad range of biological functions and are involved in many diseases. An understanding of intrinsic disorder is key to develop compounds that target IDPs. Experimental characterization of IDPs is hindered by the very fact that they are highly dynamic. Computational methods that predict disorder from the amino acid sequence have been proposed. Here, we present ADOPT (Attention DisOrder PredicTor), a new predictor of protein disorder. ADOPT is composed of a self-supervised encoder and a supervised disorder predictor. The former is based on a deep bidirectional transformer, which extracts dense residue-level representations from Facebook’s Evolutionary Scale Modeling library. The latter uses a database of nuclear magnetic resonance chemical shifts, constructed to ensure balanced amounts of disordered and ordered residues, as a training and a test dataset for protein disorder. ADOPT predicts whether a protein or a specific region is disordered with better performance than the best existing predictors and faster than most other proposed methods (a few seconds per sequence). We identify the features that are relevant for the prediction performance and show that good performance can already be gained with <100 features. ADOPT is available as a stand-alone package at https://github.com/PeptoneLtd/ADOPT and as a web server at https://adopt.peptone.io/. Oxford University Press 2023-05-01 /pmc/articles/PMC10150328/ /pubmed/37138579 http://dx.doi.org/10.1093/nargab/lqad041 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Standard Article Redl, Istvan Fisicaro, Carlo Dutton, Oliver Hoffmann, Falk Henderson, Louie Owens, Benjamin M J Heberling, Matthew Paci, Emanuele Tamiola, Kamil ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers |
title | ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers |
title_full | ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers |
title_fullStr | ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers |
title_full_unstemmed | ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers |
title_short | ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers |
title_sort | adopt: intrinsic protein disorder prediction through deep bidirectional transformers |
topic | Standard Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10150328/ https://www.ncbi.nlm.nih.gov/pubmed/37138579 http://dx.doi.org/10.1093/nargab/lqad041 |
work_keys_str_mv | AT redlistvan adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers AT fisicarocarlo adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers AT duttonoliver adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers AT hoffmannfalk adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers AT hendersonlouie adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers AT owensbenjaminmj adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers AT heberlingmatthew adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers AT paciemanuele adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers AT tamiolakamil adoptintrinsicproteindisorderpredictionthroughdeepbidirectionaltransformers |