Cargando…
Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction
The inhibition of protein–protein interactions is a growing strategy in drug development. In addition to structured regions, many protein loop regions are involved in protein–protein interactions and thus have been identified as potential drug targets. To effectively target such regions, protein str...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9312937/ https://www.ncbi.nlm.nih.gov/pubmed/35883541 http://dx.doi.org/10.3390/biom12070985 |
_version_ | 1784753955645947904 |
---|---|
author | Stevens, Amy O. He, Yi |
author_facet | Stevens, Amy O. He, Yi |
author_sort | Stevens, Amy O. |
collection | PubMed |
description | The inhibition of protein–protein interactions is a growing strategy in drug development. In addition to structured regions, many protein loop regions are involved in protein–protein interactions and thus have been identified as potential drug targets. To effectively target such regions, protein structure is critical. Loop structure prediction is a challenging subgroup in the field of protein structure prediction because of the reduced level of conservation in protein sequences compared to the secondary structure elements. AlphaFold 2 has been suggested to be one of the greatest achievements in the field of protein structure prediction. The AlphaFold 2 predicted protein structures near the X-ray resolution in the Critical Assessment of protein Structure Prediction (CASP 14) competition in 2020. The purpose of this work is to survey the performance of AlphaFold 2 in specifically predicting protein loop regions. We have constructed an independent dataset of 31,650 loop regions from 2613 proteins (deposited after the AlphaFold 2 was trained) with both experimentally determined structures and AlphaFold 2 predicted structures. With extensive evaluation using our dataset, the results indicate that AlphaFold 2 is a good predictor of the structure of loop regions, especially for short loop regions. Loops less than 10 residues in length have an average Root Mean Square Deviation (RMSD) of 0.33 Å and an average the Template Modeling score (TM-score) of 0.82. However, we see that as the number of residues in a given loop increases, the accuracy of AlphaFold 2’s prediction decreases. Loops more than 20 residues in length have an average RMSD of 2.04 Å and an average TM-score of 0.55. Such a correlation between accuracy and length of the loop is directly linked to the increase in flexibility. Moreover, AlphaFold 2 does slightly over-predict [Formula: see text]-helices and [Formula: see text]-strands in proteins. |
format | Online Article Text |
id | pubmed-9312937 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-93129372022-07-26 Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction Stevens, Amy O. He, Yi Biomolecules Article The inhibition of protein–protein interactions is a growing strategy in drug development. In addition to structured regions, many protein loop regions are involved in protein–protein interactions and thus have been identified as potential drug targets. To effectively target such regions, protein structure is critical. Loop structure prediction is a challenging subgroup in the field of protein structure prediction because of the reduced level of conservation in protein sequences compared to the secondary structure elements. AlphaFold 2 has been suggested to be one of the greatest achievements in the field of protein structure prediction. The AlphaFold 2 predicted protein structures near the X-ray resolution in the Critical Assessment of protein Structure Prediction (CASP 14) competition in 2020. The purpose of this work is to survey the performance of AlphaFold 2 in specifically predicting protein loop regions. We have constructed an independent dataset of 31,650 loop regions from 2613 proteins (deposited after the AlphaFold 2 was trained) with both experimentally determined structures and AlphaFold 2 predicted structures. With extensive evaluation using our dataset, the results indicate that AlphaFold 2 is a good predictor of the structure of loop regions, especially for short loop regions. Loops less than 10 residues in length have an average Root Mean Square Deviation (RMSD) of 0.33 Å and an average the Template Modeling score (TM-score) of 0.82. However, we see that as the number of residues in a given loop increases, the accuracy of AlphaFold 2’s prediction decreases. Loops more than 20 residues in length have an average RMSD of 2.04 Å and an average TM-score of 0.55. Such a correlation between accuracy and length of the loop is directly linked to the increase in flexibility. Moreover, AlphaFold 2 does slightly over-predict [Formula: see text]-helices and [Formula: see text]-strands in proteins. MDPI 2022-07-14 /pmc/articles/PMC9312937/ /pubmed/35883541 http://dx.doi.org/10.3390/biom12070985 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Stevens, Amy O. He, Yi Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction |
title | Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction |
title_full | Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction |
title_fullStr | Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction |
title_full_unstemmed | Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction |
title_short | Benchmarking the Accuracy of AlphaFold 2 in Loop Structure Prediction |
title_sort | benchmarking the accuracy of alphafold 2 in loop structure prediction |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9312937/ https://www.ncbi.nlm.nih.gov/pubmed/35883541 http://dx.doi.org/10.3390/biom12070985 |
work_keys_str_mv | AT stevensamyo benchmarkingtheaccuracyofalphafold2inloopstructureprediction AT heyi benchmarkingtheaccuracyofalphafold2inloopstructureprediction |