Cargando…

Assessing protein model quality based on deep graph coupled networks using protein language model

Model quality evaluation is a crucial part of protein structural biology. How to distinguish high-quality models from low-quality models, and to assess which high-quality models have relatively incorrect regions for improvement, are remain a challenge. More importantly, the quality assessment of mul...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Dong, Zhang, Biao, Liu, Jun, Li, Hui, Song, Le, Zhang, Guijun
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2023
Materias:	Problem Solving Protocol
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10685403/ https://www.ncbi.nlm.nih.gov/pubmed/38018909 http://dx.doi.org/10.1093/bib/bbad420

_version_	1785151622587875328
author	Liu, Dong Zhang, Biao Liu, Jun Li, Hui Song, Le Zhang, Guijun
author_facet	Liu, Dong Zhang, Biao Liu, Jun Li, Hui Song, Le Zhang, Guijun
author_sort	Liu, Dong
collection	PubMed
description	Model quality evaluation is a crucial part of protein structural biology. How to distinguish high-quality models from low-quality models, and to assess which high-quality models have relatively incorrect regions for improvement, are remain a challenge. More importantly, the quality assessment of multimer models is a hot topic for structure prediction. In this study, we propose GraphCPLMQA, a novel approach for evaluating residue-level model quality that combines graph coupled networks and embeddings from protein language models. The GraphCPLMQA consists of a graph encoding module and a transform-based convolutional decoding module. In encoding module, the underlying relational representations of sequence and high-dimensional geometry structure are extracted by protein language models with Evolutionary Scale Modeling. In decoding module, the mapping connection between structure and quality is inferred by the representations and low-dimensional features. Specifically, the triangular location and residue level contact order features are designed to enhance the association between the local structure and the overall topology. Experimental results demonstrate that GraphCPLMQA using single-sequence embedding achieves the best performance compared with the CASP15 residue-level interface evaluation methods among 9108 models in the local residue interface test set of CASP15 multimers. In CAMEO blind test (20 May 2022 to 13 August 2022), GraphCPLMQA ranked first compared with other servers (https://www.cameo3d.org/quality-estimation). GraphCPLMQA also outperforms state-of-the-art methods on 19, 035 models in CASP13 and CASP14 monomer test set.
format	Online Article Text
id	pubmed-10685403
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-106854032023-11-30 Assessing protein model quality based on deep graph coupled networks using protein language model Liu, Dong Zhang, Biao Liu, Jun Li, Hui Song, Le Zhang, Guijun Brief Bioinform Problem Solving Protocol Model quality evaluation is a crucial part of protein structural biology. How to distinguish high-quality models from low-quality models, and to assess which high-quality models have relatively incorrect regions for improvement, are remain a challenge. More importantly, the quality assessment of multimer models is a hot topic for structure prediction. In this study, we propose GraphCPLMQA, a novel approach for evaluating residue-level model quality that combines graph coupled networks and embeddings from protein language models. The GraphCPLMQA consists of a graph encoding module and a transform-based convolutional decoding module. In encoding module, the underlying relational representations of sequence and high-dimensional geometry structure are extracted by protein language models with Evolutionary Scale Modeling. In decoding module, the mapping connection between structure and quality is inferred by the representations and low-dimensional features. Specifically, the triangular location and residue level contact order features are designed to enhance the association between the local structure and the overall topology. Experimental results demonstrate that GraphCPLMQA using single-sequence embedding achieves the best performance compared with the CASP15 residue-level interface evaluation methods among 9108 models in the local residue interface test set of CASP15 multimers. In CAMEO blind test (20 May 2022 to 13 August 2022), GraphCPLMQA ranked first compared with other servers (https://www.cameo3d.org/quality-estimation). GraphCPLMQA also outperforms state-of-the-art methods on 19, 035 models in CASP13 and CASP14 monomer test set. Oxford University Press 2023-11-28 /pmc/articles/PMC10685403/ /pubmed/38018909 http://dx.doi.org/10.1093/bib/bbad420 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Problem Solving Protocol Liu, Dong Zhang, Biao Liu, Jun Li, Hui Song, Le Zhang, Guijun Assessing protein model quality based on deep graph coupled networks using protein language model
title	Assessing protein model quality based on deep graph coupled networks using protein language model
title_full	Assessing protein model quality based on deep graph coupled networks using protein language model
title_fullStr	Assessing protein model quality based on deep graph coupled networks using protein language model
title_full_unstemmed	Assessing protein model quality based on deep graph coupled networks using protein language model
title_short	Assessing protein model quality based on deep graph coupled networks using protein language model
title_sort	assessing protein model quality based on deep graph coupled networks using protein language model
topic	Problem Solving Protocol
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10685403/ https://www.ncbi.nlm.nih.gov/pubmed/38018909 http://dx.doi.org/10.1093/bib/bbad420
work_keys_str_mv	AT liudong assessingproteinmodelqualitybasedondeepgraphcouplednetworksusingproteinlanguagemodel AT zhangbiao assessingproteinmodelqualitybasedondeepgraphcouplednetworksusingproteinlanguagemodel AT liujun assessingproteinmodelqualitybasedondeepgraphcouplednetworksusingproteinlanguagemodel AT lihui assessingproteinmodelqualitybasedondeepgraphcouplednetworksusingproteinlanguagemodel AT songle assessingproteinmodelqualitybasedondeepgraphcouplednetworksusingproteinlanguagemodel AT zhangguijun assessingproteinmodelqualitybasedondeepgraphcouplednetworksusingproteinlanguagemodel

Assessing protein model quality based on deep graph coupled networks using protein language model

Ejemplares similares