Cargando…

Enhancing AlphaFold-Multimer-based Protein Complex Structure Prediction with MULTICOM in CASP15

AlphaFold-Multimer has emerged as the state-of-the-art tool for predicting the quaternary structure of protein complexes (assemblies or multimers) since its release in 2021. To further enhance the AlphaFold-Multimer-based complex structure prediction, we developed a new quaternary structure predicti...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Jian, Guo, Zhiye, Wu, Tianqi, Roy, Raj S., Quadir, Farhan, Chen, Chen, Cheng, Jianlin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10245707/
https://www.ncbi.nlm.nih.gov/pubmed/37293073
http://dx.doi.org/10.1101/2023.05.16.541055
_version_ 1785054912510427136
author Liu, Jian
Guo, Zhiye
Wu, Tianqi
Roy, Raj S.
Quadir, Farhan
Chen, Chen
Cheng, Jianlin
author_facet Liu, Jian
Guo, Zhiye
Wu, Tianqi
Roy, Raj S.
Quadir, Farhan
Chen, Chen
Cheng, Jianlin
author_sort Liu, Jian
collection PubMed
description AlphaFold-Multimer has emerged as the state-of-the-art tool for predicting the quaternary structure of protein complexes (assemblies or multimers) since its release in 2021. To further enhance the AlphaFold-Multimer-based complex structure prediction, we developed a new quaternary structure prediction system (MULTICOM) to improve the input fed to AlphaFold-Multimer and evaluate and refine the outputs generated by AlphaFold2-Multimer. Specifically, MULTICOM samples diverse multiple sequence alignments (MSAs) and templates for AlphaFold-Multimer to generate structural models by using both traditional sequence alignments and new Foldseek-based structure alignments, ranks structural models through multiple complementary metrics, and refines the structural models via a Foldseek structure alignment-based refinement method. The MULTICOM system with different implementations was blindly tested in the assembly structure prediction in the 15th Critical Assessment of Techniques for Protein Structure Prediction (CASP15) in 2022 as both server and human predictors. Our server (MULTICOM_qa) ranked 3(rd) among 26 CASP15 server predictors and our human predictor (MULTICOM_human) ranked 7(th) among 87 CASP15 server and human predictors. The average TM-score of the first models predicted by MULTICOM_qa for CASP15 assembly targets is ~0.76, 5.3% higher than ~0.72 of the standard AlphaFold-Multimer. The average TM-score of the best of top 5 models predicted by MULTICOM_qa is ~0.80, about 8% higher than ~0.74 of the standard AlphaFold-Multimer. Moreover, the novel Foldseek Structure Alignment-based Model Generation (FSAMG) method based on AlphaFold-Multimer outperforms the widely used sequence alignment-based model generation. The source code of MULTICOM is available at: https://github.com/BioinfoMachineLearning/MULTICOM3.
format Online
Article
Text
id pubmed-10245707
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-102457072023-06-08 Enhancing AlphaFold-Multimer-based Protein Complex Structure Prediction with MULTICOM in CASP15 Liu, Jian Guo, Zhiye Wu, Tianqi Roy, Raj S. Quadir, Farhan Chen, Chen Cheng, Jianlin bioRxiv Article AlphaFold-Multimer has emerged as the state-of-the-art tool for predicting the quaternary structure of protein complexes (assemblies or multimers) since its release in 2021. To further enhance the AlphaFold-Multimer-based complex structure prediction, we developed a new quaternary structure prediction system (MULTICOM) to improve the input fed to AlphaFold-Multimer and evaluate and refine the outputs generated by AlphaFold2-Multimer. Specifically, MULTICOM samples diverse multiple sequence alignments (MSAs) and templates for AlphaFold-Multimer to generate structural models by using both traditional sequence alignments and new Foldseek-based structure alignments, ranks structural models through multiple complementary metrics, and refines the structural models via a Foldseek structure alignment-based refinement method. The MULTICOM system with different implementations was blindly tested in the assembly structure prediction in the 15th Critical Assessment of Techniques for Protein Structure Prediction (CASP15) in 2022 as both server and human predictors. Our server (MULTICOM_qa) ranked 3(rd) among 26 CASP15 server predictors and our human predictor (MULTICOM_human) ranked 7(th) among 87 CASP15 server and human predictors. The average TM-score of the first models predicted by MULTICOM_qa for CASP15 assembly targets is ~0.76, 5.3% higher than ~0.72 of the standard AlphaFold-Multimer. The average TM-score of the best of top 5 models predicted by MULTICOM_qa is ~0.80, about 8% higher than ~0.74 of the standard AlphaFold-Multimer. Moreover, the novel Foldseek Structure Alignment-based Model Generation (FSAMG) method based on AlphaFold-Multimer outperforms the widely used sequence alignment-based model generation. The source code of MULTICOM is available at: https://github.com/BioinfoMachineLearning/MULTICOM3. Cold Spring Harbor Laboratory 2023-05-18 /pmc/articles/PMC10245707/ /pubmed/37293073 http://dx.doi.org/10.1101/2023.05.16.541055 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Liu, Jian
Guo, Zhiye
Wu, Tianqi
Roy, Raj S.
Quadir, Farhan
Chen, Chen
Cheng, Jianlin
Enhancing AlphaFold-Multimer-based Protein Complex Structure Prediction with MULTICOM in CASP15
title Enhancing AlphaFold-Multimer-based Protein Complex Structure Prediction with MULTICOM in CASP15
title_full Enhancing AlphaFold-Multimer-based Protein Complex Structure Prediction with MULTICOM in CASP15
title_fullStr Enhancing AlphaFold-Multimer-based Protein Complex Structure Prediction with MULTICOM in CASP15
title_full_unstemmed Enhancing AlphaFold-Multimer-based Protein Complex Structure Prediction with MULTICOM in CASP15
title_short Enhancing AlphaFold-Multimer-based Protein Complex Structure Prediction with MULTICOM in CASP15
title_sort enhancing alphafold-multimer-based protein complex structure prediction with multicom in casp15
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10245707/
https://www.ncbi.nlm.nih.gov/pubmed/37293073
http://dx.doi.org/10.1101/2023.05.16.541055
work_keys_str_mv AT liujian enhancingalphafoldmultimerbasedproteincomplexstructurepredictionwithmulticomincasp15
AT guozhiye enhancingalphafoldmultimerbasedproteincomplexstructurepredictionwithmulticomincasp15
AT wutianqi enhancingalphafoldmultimerbasedproteincomplexstructurepredictionwithmulticomincasp15
AT royrajs enhancingalphafoldmultimerbasedproteincomplexstructurepredictionwithmulticomincasp15
AT quadirfarhan enhancingalphafoldmultimerbasedproteincomplexstructurepredictionwithmulticomincasp15
AT chenchen enhancingalphafoldmultimerbasedproteincomplexstructurepredictionwithmulticomincasp15
AT chengjianlin enhancingalphafoldmultimerbasedproteincomplexstructurepredictionwithmulticomincasp15