Cargando…

The Impact of AI-Based Modeling on the Accuracy of Protein Assembly Prediction: Insights from CASP15

In CASP15, 87 predictors submitted around 11,000 models on 41 assembly targets. The community demonstrated exceptional performance in overall fold and interface contact prediction, achieving an impressive success rate of 90% (compared to 31% in CASP14). This remarkable accomplishment is largely due...

Descripción completa

Detalles Bibliográficos
Autores principales: Ozden, Burcu, Kryshtafovych, Andriy, Karaca, Ezgi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10369898/
https://www.ncbi.nlm.nih.gov/pubmed/37503072
http://dx.doi.org/10.1101/2023.07.10.548341
_version_ 1785077856729038848
author Ozden, Burcu
Kryshtafovych, Andriy
Karaca, Ezgi
author_facet Ozden, Burcu
Kryshtafovych, Andriy
Karaca, Ezgi
author_sort Ozden, Burcu
collection PubMed
description In CASP15, 87 predictors submitted around 11,000 models on 41 assembly targets. The community demonstrated exceptional performance in overall fold and interface contact prediction, achieving an impressive success rate of 90% (compared to 31% in CASP14). This remarkable accomplishment is largely due to the incorporation of DeepMind’s AF2-Multimer approach into custom-built prediction pipelines. To evaluate the added value of participating methods, we compared the community models to the baseline AF2-Multimer predictor. In over 1/3 of cases the community models were superior to the baseline predictor. The main reasons for this improved performance were the use of custom-built multiple sequence alignments, optimized AF2-Multimer sampling, and the manual assembly of AF2-Multimer-built subcomplexes. The best three groups, in order, are Zheng, Venclovas and Wallner. Zheng and Venclovas reached a 73.2% success rate over all (41) cases, while Wallner attained 69.4% success rate over 36 cases. Nonetheless, challenges remain in predicting structures with weak evolutionary signals, such as nanobody-antigen, antibody-antigen, and viral complexes. Expectedly, modeling large complexes remains also challenging due to their high memory compute demands. In addition to the assembly category, we assessed the accuracy of modeling interdomain interfaces in the tertiary structure prediction targets. Models on seven targets featuring 17 unique interfaces were analyzed. Best predictors achieved the 76.5% success rate, with the UM-TBM group being the leader. In the interdomain category, we observed that the predictors faced challenges, as in the case of the assembly category, when the evolutionary signal for a given domain pair was weak or the structure was large. Overall, CASP15 witnessed unprecedented improvement in interface modeling, reflecting the AI revolution seen in CASP14.
format Online
Article
Text
id pubmed-10369898
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-103698982023-07-27 The Impact of AI-Based Modeling on the Accuracy of Protein Assembly Prediction: Insights from CASP15 Ozden, Burcu Kryshtafovych, Andriy Karaca, Ezgi bioRxiv Article In CASP15, 87 predictors submitted around 11,000 models on 41 assembly targets. The community demonstrated exceptional performance in overall fold and interface contact prediction, achieving an impressive success rate of 90% (compared to 31% in CASP14). This remarkable accomplishment is largely due to the incorporation of DeepMind’s AF2-Multimer approach into custom-built prediction pipelines. To evaluate the added value of participating methods, we compared the community models to the baseline AF2-Multimer predictor. In over 1/3 of cases the community models were superior to the baseline predictor. The main reasons for this improved performance were the use of custom-built multiple sequence alignments, optimized AF2-Multimer sampling, and the manual assembly of AF2-Multimer-built subcomplexes. The best three groups, in order, are Zheng, Venclovas and Wallner. Zheng and Venclovas reached a 73.2% success rate over all (41) cases, while Wallner attained 69.4% success rate over 36 cases. Nonetheless, challenges remain in predicting structures with weak evolutionary signals, such as nanobody-antigen, antibody-antigen, and viral complexes. Expectedly, modeling large complexes remains also challenging due to their high memory compute demands. In addition to the assembly category, we assessed the accuracy of modeling interdomain interfaces in the tertiary structure prediction targets. Models on seven targets featuring 17 unique interfaces were analyzed. Best predictors achieved the 76.5% success rate, with the UM-TBM group being the leader. In the interdomain category, we observed that the predictors faced challenges, as in the case of the assembly category, when the evolutionary signal for a given domain pair was weak or the structure was large. Overall, CASP15 witnessed unprecedented improvement in interface modeling, reflecting the AI revolution seen in CASP14. Cold Spring Harbor Laboratory 2023-09-19 /pmc/articles/PMC10369898/ /pubmed/37503072 http://dx.doi.org/10.1101/2023.07.10.548341 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Ozden, Burcu
Kryshtafovych, Andriy
Karaca, Ezgi
The Impact of AI-Based Modeling on the Accuracy of Protein Assembly Prediction: Insights from CASP15
title The Impact of AI-Based Modeling on the Accuracy of Protein Assembly Prediction: Insights from CASP15
title_full The Impact of AI-Based Modeling on the Accuracy of Protein Assembly Prediction: Insights from CASP15
title_fullStr The Impact of AI-Based Modeling on the Accuracy of Protein Assembly Prediction: Insights from CASP15
title_full_unstemmed The Impact of AI-Based Modeling on the Accuracy of Protein Assembly Prediction: Insights from CASP15
title_short The Impact of AI-Based Modeling on the Accuracy of Protein Assembly Prediction: Insights from CASP15
title_sort impact of ai-based modeling on the accuracy of protein assembly prediction: insights from casp15
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10369898/
https://www.ncbi.nlm.nih.gov/pubmed/37503072
http://dx.doi.org/10.1101/2023.07.10.548341
work_keys_str_mv AT ozdenburcu theimpactofaibasedmodelingontheaccuracyofproteinassemblypredictioninsightsfromcasp15
AT kryshtafovychandriy theimpactofaibasedmodelingontheaccuracyofproteinassemblypredictioninsightsfromcasp15
AT karacaezgi theimpactofaibasedmodelingontheaccuracyofproteinassemblypredictioninsightsfromcasp15
AT ozdenburcu impactofaibasedmodelingontheaccuracyofproteinassemblypredictioninsightsfromcasp15
AT kryshtafovychandriy impactofaibasedmodelingontheaccuracyofproteinassemblypredictioninsightsfromcasp15
AT karacaezgi impactofaibasedmodelingontheaccuracyofproteinassemblypredictioninsightsfromcasp15