Cargando…

Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15

Estimating the accuracy of quaternary structural models of protein complexes and assemblies (EMA) is important for predicting quaternary structures and applying them to studying protein function and interaction. The pairwise similarity between structural models is proven useful for estimating the qu...

Descripción completa

Detalles Bibliográficos
Autores principales: Roy, Raj S., Liu, Jian, Giri, Nabin, Guo, Zhiye, Cheng, Jianlin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10028888/
https://www.ncbi.nlm.nih.gov/pubmed/36945536
http://dx.doi.org/10.1101/2023.03.08.531814
_version_ 1784910038327885824
author Roy, Raj S.
Liu, Jian
Giri, Nabin
Guo, Zhiye
Cheng, Jianlin
author_facet Roy, Raj S.
Liu, Jian
Giri, Nabin
Guo, Zhiye
Cheng, Jianlin
author_sort Roy, Raj S.
collection PubMed
description Estimating the accuracy of quaternary structural models of protein complexes and assemblies (EMA) is important for predicting quaternary structures and applying them to studying protein function and interaction. The pairwise similarity between structural models is proven useful for estimating the quality of protein tertiary structural models, but it has been rarely applied to predicting the quality of quaternary structural models. Moreover, the pairwise similarity approach often fails when many structural models are of low quality and similar to each other. To address the gap, we developed a hybrid method (MULTICOM_qa) combining a pairwise similarity score (PSS) and an interface contact probability score (ICPS) based on the deep learning inter-chain contact prediction for estimating protein complex model accuracy. It blindly participated in the 15th Critical Assessment of Techniques for Protein Structure Prediction (CASP15) in 2022 and ranked first out of 24 predictors in estimating the global accuracy of assembly models. The average per-target correlation coefficient between the model quality scores predicted by MULTICOM_qa and the true quality scores of the models of CASP15 assembly targets is 0.66. The average per-target ranking loss in using the predicted quality scores to rank the models is 0.14. It was able to select good models for most targets. Moreover, several key factors (i.e., target difficulty, model sampling difficulty, skewness of model quality, and similarity between good/bad models) for EMA are identified and analayzed. The results demonstrate that combining the multi-model method (PSS) with the complementary single-model method (ICPS) is a promising approach to EMA. The source code of MULTICOM_qa is available at https://github.com/BioinfoMachineLearning/MULTICOM_qa.
format Online
Article
Text
id pubmed-10028888
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-100288882023-03-22 Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15 Roy, Raj S. Liu, Jian Giri, Nabin Guo, Zhiye Cheng, Jianlin bioRxiv Article Estimating the accuracy of quaternary structural models of protein complexes and assemblies (EMA) is important for predicting quaternary structures and applying them to studying protein function and interaction. The pairwise similarity between structural models is proven useful for estimating the quality of protein tertiary structural models, but it has been rarely applied to predicting the quality of quaternary structural models. Moreover, the pairwise similarity approach often fails when many structural models are of low quality and similar to each other. To address the gap, we developed a hybrid method (MULTICOM_qa) combining a pairwise similarity score (PSS) and an interface contact probability score (ICPS) based on the deep learning inter-chain contact prediction for estimating protein complex model accuracy. It blindly participated in the 15th Critical Assessment of Techniques for Protein Structure Prediction (CASP15) in 2022 and ranked first out of 24 predictors in estimating the global accuracy of assembly models. The average per-target correlation coefficient between the model quality scores predicted by MULTICOM_qa and the true quality scores of the models of CASP15 assembly targets is 0.66. The average per-target ranking loss in using the predicted quality scores to rank the models is 0.14. It was able to select good models for most targets. Moreover, several key factors (i.e., target difficulty, model sampling difficulty, skewness of model quality, and similarity between good/bad models) for EMA are identified and analayzed. The results demonstrate that combining the multi-model method (PSS) with the complementary single-model method (ICPS) is a promising approach to EMA. The source code of MULTICOM_qa is available at https://github.com/BioinfoMachineLearning/MULTICOM_qa. Cold Spring Harbor Laboratory 2023-03-12 /pmc/articles/PMC10028888/ /pubmed/36945536 http://dx.doi.org/10.1101/2023.03.08.531814 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Roy, Raj S.
Liu, Jian
Giri, Nabin
Guo, Zhiye
Cheng, Jianlin
Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15
title Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15
title_full Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15
title_fullStr Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15
title_full_unstemmed Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15
title_short Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15
title_sort combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in casp15
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10028888/
https://www.ncbi.nlm.nih.gov/pubmed/36945536
http://dx.doi.org/10.1101/2023.03.08.531814
work_keys_str_mv AT royrajs combiningpairwisestructuralsimilarityanddeeplearninginterfacecontactpredictiontoestimateproteincomplexmodelaccuracyincasp15
AT liujian combiningpairwisestructuralsimilarityanddeeplearninginterfacecontactpredictiontoestimateproteincomplexmodelaccuracyincasp15
AT girinabin combiningpairwisestructuralsimilarityanddeeplearninginterfacecontactpredictiontoestimateproteincomplexmodelaccuracyincasp15
AT guozhiye combiningpairwisestructuralsimilarityanddeeplearninginterfacecontactpredictiontoestimateproteincomplexmodelaccuracyincasp15
AT chengjianlin combiningpairwisestructuralsimilarityanddeeplearninginterfacecontactpredictiontoestimateproteincomplexmodelaccuracyincasp15