Cargando…
Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold
The recent breakthrough from AlphaFold2 and RoseTTAFold set a profound milestone for solving the protein folding problem, but they were not explicitly trained to predict protein foldability, i.e., if a protein can really fold into the predicted 3D structure. We wondered if the computational models f...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9421090/ https://www.ncbi.nlm.nih.gov/pubmed/36051869 http://dx.doi.org/10.1016/j.csbj.2022.08.034 |
_version_ | 1784777518470922240 |
---|---|
author | Liu, Sen Wu, Kan Chen, Cheng |
author_facet | Liu, Sen Wu, Kan Chen, Cheng |
author_sort | Liu, Sen |
collection | PubMed |
description | The recent breakthrough from AlphaFold2 and RoseTTAFold set a profound milestone for solving the protein folding problem, but they were not explicitly trained to predict protein foldability, i.e., if a protein can really fold into the predicted 3D structure. We wondered if the computational models from AlphaFold2 and RoseTTAFold might carry protein foldability information. Therefore, we predicted the structural models of 159 circular permutants and 158 alanine insertion mutants of the 159-residue dihydrofolate reductase. Our data showed that although AlphaFold2 and RoseTTAFold cannot directly identify unfoldable proteins, the RMSD values of computational models are correlated with protein foldability, with higher RMSD values indicating lower protein foldability. Furthermore, this correlation is independent of secondary structures, and the RMSD values of computational models are quantitatively correlated with protein foldability but not protein functions. Additionally, using a dataset of 129 de novo designed proteins, we showed that inter-model RMSD values between AlphaFold2 models and RoseTTAFold models are a good indicator of protein foldability. At last, we showed that inter-model RMSD values are also useful for evaluating protein solubility by modeling 1664 natural proteins. Our work could be of great value to the design of novel proteins and the prediction of protein foldability. |
format | Online Article Text |
id | pubmed-9421090 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-94210902022-08-31 Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold Liu, Sen Wu, Kan Chen, Cheng Comput Struct Biotechnol J Research Article The recent breakthrough from AlphaFold2 and RoseTTAFold set a profound milestone for solving the protein folding problem, but they were not explicitly trained to predict protein foldability, i.e., if a protein can really fold into the predicted 3D structure. We wondered if the computational models from AlphaFold2 and RoseTTAFold might carry protein foldability information. Therefore, we predicted the structural models of 159 circular permutants and 158 alanine insertion mutants of the 159-residue dihydrofolate reductase. Our data showed that although AlphaFold2 and RoseTTAFold cannot directly identify unfoldable proteins, the RMSD values of computational models are correlated with protein foldability, with higher RMSD values indicating lower protein foldability. Furthermore, this correlation is independent of secondary structures, and the RMSD values of computational models are quantitatively correlated with protein foldability but not protein functions. Additionally, using a dataset of 129 de novo designed proteins, we showed that inter-model RMSD values between AlphaFold2 models and RoseTTAFold models are a good indicator of protein foldability. At last, we showed that inter-model RMSD values are also useful for evaluating protein solubility by modeling 1664 natural proteins. Our work could be of great value to the design of novel proteins and the prediction of protein foldability. Research Network of Computational and Structural Biotechnology 2022-08-17 /pmc/articles/PMC9421090/ /pubmed/36051869 http://dx.doi.org/10.1016/j.csbj.2022.08.034 Text en © 2022 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Article Liu, Sen Wu, Kan Chen, Cheng Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold |
title | Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold |
title_full | Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold |
title_fullStr | Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold |
title_full_unstemmed | Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold |
title_short | Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold |
title_sort | obtaining protein foldability information from computational models of alphafold2 and rosettafold |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9421090/ https://www.ncbi.nlm.nih.gov/pubmed/36051869 http://dx.doi.org/10.1016/j.csbj.2022.08.034 |
work_keys_str_mv | AT liusen obtainingproteinfoldabilityinformationfromcomputationalmodelsofalphafold2androsettafold AT wukan obtainingproteinfoldabilityinformationfromcomputationalmodelsofalphafold2androsettafold AT chencheng obtainingproteinfoldabilityinformationfromcomputationalmodelsofalphafold2androsettafold |