Cargando…

Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2

The AlphaFold Protein Structure Database contains predicted structures for millions of proteins. For the majority of human proteins that contain intrinsically disordered regions (IDRs), which do not adopt a stable structure, it is generally assumed that these regions have low AlphaFold2 confidence s...

Descripción completa

Detalles Bibliográficos
Autores principales: Alderson, T. Reid, Pritišanac, Iva, Kolarić, Đesika, Moses, Alan M., Forman-Kay, Julie D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10622901/
https://www.ncbi.nlm.nih.gov/pubmed/37878721
http://dx.doi.org/10.1073/pnas.2304302120
_version_ 1785130641981964288
author Alderson, T. Reid
Pritišanac, Iva
Kolarić, Đesika
Moses, Alan M.
Forman-Kay, Julie D.
author_facet Alderson, T. Reid
Pritišanac, Iva
Kolarić, Đesika
Moses, Alan M.
Forman-Kay, Julie D.
author_sort Alderson, T. Reid
collection PubMed
description The AlphaFold Protein Structure Database contains predicted structures for millions of proteins. For the majority of human proteins that contain intrinsically disordered regions (IDRs), which do not adopt a stable structure, it is generally assumed that these regions have low AlphaFold2 confidence scores that reflect low-confidence structural predictions. Here, we show that AlphaFold2 assigns confident structures to nearly 15% of human IDRs. By comparison to experimental NMR data for a subset of IDRs that are known to conditionally fold (i.e., upon binding or under other specific conditions), we find that AlphaFold2 often predicts the structure of the conditionally folded state. Based on databases of IDRs that are known to conditionally fold, we estimate that AlphaFold2 can identify conditionally folding IDRs at a precision as high as 88% at a 10% false positive rate, which is remarkable considering that conditionally folded IDR structures were minimally represented in its training data. We find that human disease mutations are nearly fivefold enriched in conditionally folded IDRs over IDRs in general and that up to 80% of IDRs in prokaryotes are predicted to conditionally fold, compared to less than 20% of eukaryotic IDRs. These results indicate that a large majority of IDRs in the proteomes of human and other eukaryotes function in the absence of conditional folding, but the regions that do acquire folds are more sensitive to mutations. We emphasize that the AlphaFold2 predictions do not reveal functionally relevant structural plasticity within IDRs and cannot offer realistic ensemble representations of conditionally folded IDRs.
format Online
Article
Text
id pubmed-10622901
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-106229012023-11-04 Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2 Alderson, T. Reid Pritišanac, Iva Kolarić, Đesika Moses, Alan M. Forman-Kay, Julie D. Proc Natl Acad Sci U S A Biological Sciences The AlphaFold Protein Structure Database contains predicted structures for millions of proteins. For the majority of human proteins that contain intrinsically disordered regions (IDRs), which do not adopt a stable structure, it is generally assumed that these regions have low AlphaFold2 confidence scores that reflect low-confidence structural predictions. Here, we show that AlphaFold2 assigns confident structures to nearly 15% of human IDRs. By comparison to experimental NMR data for a subset of IDRs that are known to conditionally fold (i.e., upon binding or under other specific conditions), we find that AlphaFold2 often predicts the structure of the conditionally folded state. Based on databases of IDRs that are known to conditionally fold, we estimate that AlphaFold2 can identify conditionally folding IDRs at a precision as high as 88% at a 10% false positive rate, which is remarkable considering that conditionally folded IDR structures were minimally represented in its training data. We find that human disease mutations are nearly fivefold enriched in conditionally folded IDRs over IDRs in general and that up to 80% of IDRs in prokaryotes are predicted to conditionally fold, compared to less than 20% of eukaryotic IDRs. These results indicate that a large majority of IDRs in the proteomes of human and other eukaryotes function in the absence of conditional folding, but the regions that do acquire folds are more sensitive to mutations. We emphasize that the AlphaFold2 predictions do not reveal functionally relevant structural plasticity within IDRs and cannot offer realistic ensemble representations of conditionally folded IDRs. National Academy of Sciences 2023-10-25 2023-10-31 /pmc/articles/PMC10622901/ /pubmed/37878721 http://dx.doi.org/10.1073/pnas.2304302120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Biological Sciences
Alderson, T. Reid
Pritišanac, Iva
Kolarić, Đesika
Moses, Alan M.
Forman-Kay, Julie D.
Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2
title Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2
title_full Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2
title_fullStr Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2
title_full_unstemmed Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2
title_short Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2
title_sort systematic identification of conditionally folded intrinsically disordered regions by alphafold2
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10622901/
https://www.ncbi.nlm.nih.gov/pubmed/37878721
http://dx.doi.org/10.1073/pnas.2304302120
work_keys_str_mv AT aldersontreid systematicidentificationofconditionallyfoldedintrinsicallydisorderedregionsbyalphafold2
AT pritisanaciva systematicidentificationofconditionallyfoldedintrinsicallydisorderedregionsbyalphafold2
AT kolaricđesika systematicidentificationofconditionallyfoldedintrinsicallydisorderedregionsbyalphafold2
AT mosesalanm systematicidentificationofconditionallyfoldedintrinsicallydisorderedregionsbyalphafold2
AT formankayjulied systematicidentificationofconditionallyfoldedintrinsicallydisorderedregionsbyalphafold2