Cargando…

Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold

The recent breakthrough from AlphaFold2 and RoseTTAFold set a profound milestone for solving the protein folding problem, but they were not explicitly trained to predict protein foldability, i.e., if a protein can really fold into the predicted 3D structure. We wondered if the computational models f...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Sen, Wu, Kan, Chen, Cheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9421090/
https://www.ncbi.nlm.nih.gov/pubmed/36051869
http://dx.doi.org/10.1016/j.csbj.2022.08.034
_version_ 1784777518470922240
author Liu, Sen
Wu, Kan
Chen, Cheng
author_facet Liu, Sen
Wu, Kan
Chen, Cheng
author_sort Liu, Sen
collection PubMed
description The recent breakthrough from AlphaFold2 and RoseTTAFold set a profound milestone for solving the protein folding problem, but they were not explicitly trained to predict protein foldability, i.e., if a protein can really fold into the predicted 3D structure. We wondered if the computational models from AlphaFold2 and RoseTTAFold might carry protein foldability information. Therefore, we predicted the structural models of 159 circular permutants and 158 alanine insertion mutants of the 159-residue dihydrofolate reductase. Our data showed that although AlphaFold2 and RoseTTAFold cannot directly identify unfoldable proteins, the RMSD values of computational models are correlated with protein foldability, with higher RMSD values indicating lower protein foldability. Furthermore, this correlation is independent of secondary structures, and the RMSD values of computational models are quantitatively correlated with protein foldability but not protein functions. Additionally, using a dataset of 129 de novo designed proteins, we showed that inter-model RMSD values between AlphaFold2 models and RoseTTAFold models are a good indicator of protein foldability. At last, we showed that inter-model RMSD values are also useful for evaluating protein solubility by modeling 1664 natural proteins. Our work could be of great value to the design of novel proteins and the prediction of protein foldability.
format Online
Article
Text
id pubmed-9421090
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-94210902022-08-31 Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold Liu, Sen Wu, Kan Chen, Cheng Comput Struct Biotechnol J Research Article The recent breakthrough from AlphaFold2 and RoseTTAFold set a profound milestone for solving the protein folding problem, but they were not explicitly trained to predict protein foldability, i.e., if a protein can really fold into the predicted 3D structure. We wondered if the computational models from AlphaFold2 and RoseTTAFold might carry protein foldability information. Therefore, we predicted the structural models of 159 circular permutants and 158 alanine insertion mutants of the 159-residue dihydrofolate reductase. Our data showed that although AlphaFold2 and RoseTTAFold cannot directly identify unfoldable proteins, the RMSD values of computational models are correlated with protein foldability, with higher RMSD values indicating lower protein foldability. Furthermore, this correlation is independent of secondary structures, and the RMSD values of computational models are quantitatively correlated with protein foldability but not protein functions. Additionally, using a dataset of 129 de novo designed proteins, we showed that inter-model RMSD values between AlphaFold2 models and RoseTTAFold models are a good indicator of protein foldability. At last, we showed that inter-model RMSD values are also useful for evaluating protein solubility by modeling 1664 natural proteins. Our work could be of great value to the design of novel proteins and the prediction of protein foldability. Research Network of Computational and Structural Biotechnology 2022-08-17 /pmc/articles/PMC9421090/ /pubmed/36051869 http://dx.doi.org/10.1016/j.csbj.2022.08.034 Text en © 2022 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Liu, Sen
Wu, Kan
Chen, Cheng
Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold
title Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold
title_full Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold
title_fullStr Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold
title_full_unstemmed Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold
title_short Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold
title_sort obtaining protein foldability information from computational models of alphafold2 and rosettafold
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9421090/
https://www.ncbi.nlm.nih.gov/pubmed/36051869
http://dx.doi.org/10.1016/j.csbj.2022.08.034
work_keys_str_mv AT liusen obtainingproteinfoldabilityinformationfromcomputationalmodelsofalphafold2androsettafold
AT wukan obtainingproteinfoldabilityinformationfromcomputationalmodelsofalphafold2androsettafold
AT chencheng obtainingproteinfoldabilityinformationfromcomputationalmodelsofalphafold2androsettafold