Cargando…
Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2
The AlphaFold Protein Structure Database contains predicted structures for millions of proteins. For the majority of human proteins that contain intrinsically disordered regions (IDRs), which do not adopt a stable structure, it is generally assumed that these regions have low AlphaFold2 confidence s...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10622901/ https://www.ncbi.nlm.nih.gov/pubmed/37878721 http://dx.doi.org/10.1073/pnas.2304302120 |
_version_ | 1785130641981964288 |
---|---|
author | Alderson, T. Reid Pritišanac, Iva Kolarić, Đesika Moses, Alan M. Forman-Kay, Julie D. |
author_facet | Alderson, T. Reid Pritišanac, Iva Kolarić, Đesika Moses, Alan M. Forman-Kay, Julie D. |
author_sort | Alderson, T. Reid |
collection | PubMed |
description | The AlphaFold Protein Structure Database contains predicted structures for millions of proteins. For the majority of human proteins that contain intrinsically disordered regions (IDRs), which do not adopt a stable structure, it is generally assumed that these regions have low AlphaFold2 confidence scores that reflect low-confidence structural predictions. Here, we show that AlphaFold2 assigns confident structures to nearly 15% of human IDRs. By comparison to experimental NMR data for a subset of IDRs that are known to conditionally fold (i.e., upon binding or under other specific conditions), we find that AlphaFold2 often predicts the structure of the conditionally folded state. Based on databases of IDRs that are known to conditionally fold, we estimate that AlphaFold2 can identify conditionally folding IDRs at a precision as high as 88% at a 10% false positive rate, which is remarkable considering that conditionally folded IDR structures were minimally represented in its training data. We find that human disease mutations are nearly fivefold enriched in conditionally folded IDRs over IDRs in general and that up to 80% of IDRs in prokaryotes are predicted to conditionally fold, compared to less than 20% of eukaryotic IDRs. These results indicate that a large majority of IDRs in the proteomes of human and other eukaryotes function in the absence of conditional folding, but the regions that do acquire folds are more sensitive to mutations. We emphasize that the AlphaFold2 predictions do not reveal functionally relevant structural plasticity within IDRs and cannot offer realistic ensemble representations of conditionally folded IDRs. |
format | Online Article Text |
id | pubmed-10622901 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-106229012023-11-04 Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2 Alderson, T. Reid Pritišanac, Iva Kolarić, Đesika Moses, Alan M. Forman-Kay, Julie D. Proc Natl Acad Sci U S A Biological Sciences The AlphaFold Protein Structure Database contains predicted structures for millions of proteins. For the majority of human proteins that contain intrinsically disordered regions (IDRs), which do not adopt a stable structure, it is generally assumed that these regions have low AlphaFold2 confidence scores that reflect low-confidence structural predictions. Here, we show that AlphaFold2 assigns confident structures to nearly 15% of human IDRs. By comparison to experimental NMR data for a subset of IDRs that are known to conditionally fold (i.e., upon binding or under other specific conditions), we find that AlphaFold2 often predicts the structure of the conditionally folded state. Based on databases of IDRs that are known to conditionally fold, we estimate that AlphaFold2 can identify conditionally folding IDRs at a precision as high as 88% at a 10% false positive rate, which is remarkable considering that conditionally folded IDR structures were minimally represented in its training data. We find that human disease mutations are nearly fivefold enriched in conditionally folded IDRs over IDRs in general and that up to 80% of IDRs in prokaryotes are predicted to conditionally fold, compared to less than 20% of eukaryotic IDRs. These results indicate that a large majority of IDRs in the proteomes of human and other eukaryotes function in the absence of conditional folding, but the regions that do acquire folds are more sensitive to mutations. We emphasize that the AlphaFold2 predictions do not reveal functionally relevant structural plasticity within IDRs and cannot offer realistic ensemble representations of conditionally folded IDRs. National Academy of Sciences 2023-10-25 2023-10-31 /pmc/articles/PMC10622901/ /pubmed/37878721 http://dx.doi.org/10.1073/pnas.2304302120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Biological Sciences Alderson, T. Reid Pritišanac, Iva Kolarić, Đesika Moses, Alan M. Forman-Kay, Julie D. Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2 |
title | Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2 |
title_full | Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2 |
title_fullStr | Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2 |
title_full_unstemmed | Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2 |
title_short | Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2 |
title_sort | systematic identification of conditionally folded intrinsically disordered regions by alphafold2 |
topic | Biological Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10622901/ https://www.ncbi.nlm.nih.gov/pubmed/37878721 http://dx.doi.org/10.1073/pnas.2304302120 |
work_keys_str_mv | AT aldersontreid systematicidentificationofconditionallyfoldedintrinsicallydisorderedregionsbyalphafold2 AT pritisanaciva systematicidentificationofconditionallyfoldedintrinsicallydisorderedregionsbyalphafold2 AT kolaricđesika systematicidentificationofconditionallyfoldedintrinsicallydisorderedregionsbyalphafold2 AT mosesalanm systematicidentificationofconditionallyfoldedintrinsicallydisorderedregionsbyalphafold2 AT formankayjulied systematicidentificationofconditionallyfoldedintrinsicallydisorderedregionsbyalphafold2 |