Cargando…

Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction

The inhibition of protein–protein interactions is a growing strategy in drug development. In addition to structured regions, many protein loop regions are involved in protein–protein interactions and thus have been identified as potential drug targets. To effectively target such regions, protein str...

Descripción completa

Detalles Bibliográficos
Autores principales: Stevens, Amy O., He, Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9312937/
https://www.ncbi.nlm.nih.gov/pubmed/35883541
http://dx.doi.org/10.3390/biom12070985
_version_ 1784753955645947904
author Stevens, Amy O.
He, Yi
author_facet Stevens, Amy O.
He, Yi
author_sort Stevens, Amy O.
collection PubMed
description The inhibition of protein–protein interactions is a growing strategy in drug development. In addition to structured regions, many protein loop regions are involved in protein–protein interactions and thus have been identified as potential drug targets. To effectively target such regions, protein structure is critical. Loop structure prediction is a challenging subgroup in the field of protein structure prediction because of the reduced level of conservation in protein sequences compared to the secondary structure elements. AlphaFold 2 has been suggested to be one of the greatest achievements in the field of protein structure prediction. The AlphaFold 2 predicted protein structures near the X-ray resolution in the Critical Assessment of protein Structure Prediction (CASP 14) competition in 2020. The purpose of this work is to survey the performance of AlphaFold 2 in specifically predicting protein loop regions. We have constructed an independent dataset of 31,650 loop regions from 2613 proteins (deposited after the AlphaFold 2 was trained) with both experimentally determined structures and AlphaFold 2 predicted structures. With extensive evaluation using our dataset, the results indicate that AlphaFold 2 is a good predictor of the structure of loop regions, especially for short loop regions. Loops less than 10 residues in length have an average Root Mean Square Deviation (RMSD) of 0.33 Å and an average the Template Modeling score (TM-score) of 0.82. However, we see that as the number of residues in a given loop increases, the accuracy of AlphaFold 2’s prediction decreases. Loops more than 20 residues in length have an average RMSD of 2.04 Å and an average TM-score of 0.55. Such a correlation between accuracy and length of the loop is directly linked to the increase in flexibility. Moreover, AlphaFold 2 does slightly over-predict [Formula: see text]-helices and [Formula: see text]-strands in proteins.
format Online
Article
Text
id pubmed-9312937
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93129372022-07-26 Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction Stevens, Amy O. He, Yi Biomolecules Article The inhibition of protein–protein interactions is a growing strategy in drug development. In addition to structured regions, many protein loop regions are involved in protein–protein interactions and thus have been identified as potential drug targets. To effectively target such regions, protein structure is critical. Loop structure prediction is a challenging subgroup in the field of protein structure prediction because of the reduced level of conservation in protein sequences compared to the secondary structure elements. AlphaFold 2 has been suggested to be one of the greatest achievements in the field of protein structure prediction. The AlphaFold 2 predicted protein structures near the X-ray resolution in the Critical Assessment of protein Structure Prediction (CASP 14) competition in 2020. The purpose of this work is to survey the performance of AlphaFold 2 in specifically predicting protein loop regions. We have constructed an independent dataset of 31,650 loop regions from 2613 proteins (deposited after the AlphaFold 2 was trained) with both experimentally determined structures and AlphaFold 2 predicted structures. With extensive evaluation using our dataset, the results indicate that AlphaFold 2 is a good predictor of the structure of loop regions, especially for short loop regions. Loops less than 10 residues in length have an average Root Mean Square Deviation (RMSD) of 0.33 Å and an average the Template Modeling score (TM-score) of 0.82. However, we see that as the number of residues in a given loop increases, the accuracy of AlphaFold 2’s prediction decreases. Loops more than 20 residues in length have an average RMSD of 2.04 Å and an average TM-score of 0.55. Such a correlation between accuracy and length of the loop is directly linked to the increase in flexibility. Moreover, AlphaFold 2 does slightly over-predict [Formula: see text]-helices and [Formula: see text]-strands in proteins. MDPI 2022-07-14 /pmc/articles/PMC9312937/ /pubmed/35883541 http://dx.doi.org/10.3390/biom12070985 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Stevens, Amy O.
He, Yi
Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction
title Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction
title_full Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction
title_fullStr Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction
title_full_unstemmed Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction
title_short Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction
title_sort benchmarking the accuracy of alphafold 2 in loop structure prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9312937/
https://www.ncbi.nlm.nih.gov/pubmed/35883541
http://dx.doi.org/10.3390/biom12070985
work_keys_str_mv AT stevensamyo benchmarkingtheaccuracyofalphafold2inloopstructureprediction
AT heyi benchmarkingtheaccuracyofalphafold2inloopstructureprediction