Cargando…

Assessing protein model quality based on deep graph coupled networks using protein language model

Model quality evaluation is a crucial part of protein structural biology. How to distinguish high-quality models from low-quality models, and to assess which high-quality models have relatively incorrect regions for improvement, are remain a challenge. More importantly, the quality assessment of mul...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Dong, Zhang, Biao, Liu, Jun, Li, Hui, Song, Le, Zhang, Guijun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10685403/
https://www.ncbi.nlm.nih.gov/pubmed/38018909
http://dx.doi.org/10.1093/bib/bbad420
_version_ 1785151622587875328
author Liu, Dong
Zhang, Biao
Liu, Jun
Li, Hui
Song, Le
Zhang, Guijun
author_facet Liu, Dong
Zhang, Biao
Liu, Jun
Li, Hui
Song, Le
Zhang, Guijun
author_sort Liu, Dong
collection PubMed
description Model quality evaluation is a crucial part of protein structural biology. How to distinguish high-quality models from low-quality models, and to assess which high-quality models have relatively incorrect regions for improvement, are remain a challenge. More importantly, the quality assessment of multimer models is a hot topic for structure prediction. In this study, we propose GraphCPLMQA, a novel approach for evaluating residue-level model quality that combines graph coupled networks and embeddings from protein language models. The GraphCPLMQA consists of a graph encoding module and a transform-based convolutional decoding module. In encoding module, the underlying relational representations of sequence and high-dimensional geometry structure are extracted by protein language models with Evolutionary Scale Modeling. In decoding module, the mapping connection between structure and quality is inferred by the representations and low-dimensional features. Specifically, the triangular location and residue level contact order features are designed to enhance the association between the local structure and the overall topology. Experimental results demonstrate that GraphCPLMQA using single-sequence embedding achieves the best performance compared with the CASP15 residue-level interface evaluation methods among 9108 models in the local residue interface test set of CASP15 multimers. In CAMEO blind test (20 May 2022 to 13 August 2022), GraphCPLMQA ranked first compared with other servers (https://www.cameo3d.org/quality-estimation). GraphCPLMQA also outperforms state-of-the-art methods on 19, 035 models in CASP13 and CASP14 monomer test set.
format Online
Article
Text
id pubmed-10685403
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106854032023-11-30 Assessing protein model quality based on deep graph coupled networks using protein language model Liu, Dong Zhang, Biao Liu, Jun Li, Hui Song, Le Zhang, Guijun Brief Bioinform Problem Solving Protocol Model quality evaluation is a crucial part of protein structural biology. How to distinguish high-quality models from low-quality models, and to assess which high-quality models have relatively incorrect regions for improvement, are remain a challenge. More importantly, the quality assessment of multimer models is a hot topic for structure prediction. In this study, we propose GraphCPLMQA, a novel approach for evaluating residue-level model quality that combines graph coupled networks and embeddings from protein language models. The GraphCPLMQA consists of a graph encoding module and a transform-based convolutional decoding module. In encoding module, the underlying relational representations of sequence and high-dimensional geometry structure are extracted by protein language models with Evolutionary Scale Modeling. In decoding module, the mapping connection between structure and quality is inferred by the representations and low-dimensional features. Specifically, the triangular location and residue level contact order features are designed to enhance the association between the local structure and the overall topology. Experimental results demonstrate that GraphCPLMQA using single-sequence embedding achieves the best performance compared with the CASP15 residue-level interface evaluation methods among 9108 models in the local residue interface test set of CASP15 multimers. In CAMEO blind test (20 May 2022 to 13 August 2022), GraphCPLMQA ranked first compared with other servers (https://www.cameo3d.org/quality-estimation). GraphCPLMQA also outperforms state-of-the-art methods on 19, 035 models in CASP13 and CASP14 monomer test set. Oxford University Press 2023-11-28 /pmc/articles/PMC10685403/ /pubmed/38018909 http://dx.doi.org/10.1093/bib/bbad420 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Problem Solving Protocol
Liu, Dong
Zhang, Biao
Liu, Jun
Li, Hui
Song, Le
Zhang, Guijun
Assessing protein model quality based on deep graph coupled networks using protein language model
title Assessing protein model quality based on deep graph coupled networks using protein language model
title_full Assessing protein model quality based on deep graph coupled networks using protein language model
title_fullStr Assessing protein model quality based on deep graph coupled networks using protein language model
title_full_unstemmed Assessing protein model quality based on deep graph coupled networks using protein language model
title_short Assessing protein model quality based on deep graph coupled networks using protein language model
title_sort assessing protein model quality based on deep graph coupled networks using protein language model
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10685403/
https://www.ncbi.nlm.nih.gov/pubmed/38018909
http://dx.doi.org/10.1093/bib/bbad420
work_keys_str_mv AT liudong assessingproteinmodelqualitybasedondeepgraphcouplednetworksusingproteinlanguagemodel
AT zhangbiao assessingproteinmodelqualitybasedondeepgraphcouplednetworksusingproteinlanguagemodel
AT liujun assessingproteinmodelqualitybasedondeepgraphcouplednetworksusingproteinlanguagemodel
AT lihui assessingproteinmodelqualitybasedondeepgraphcouplednetworksusingproteinlanguagemodel
AT songle assessingproteinmodelqualitybasedondeepgraphcouplednetworksusingproteinlanguagemodel
AT zhangguijun assessingproteinmodelqualitybasedondeepgraphcouplednetworksusingproteinlanguagemodel