Cargando…

Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14

The inter-residue contact prediction and deep learning showed the promise to improve the estimation of protein model accuracy (EMA) in the 13th Critical Assessment of Protein Structure Prediction (CASP13). To further leverage the improved inter-residue distance predictions to enhance EMA, during the...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Xiao, Liu, Jian, Guo, Zhiye, Wu, Tianqi, Hou, Jie, Cheng, Jianlin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8149836/
https://www.ncbi.nlm.nih.gov/pubmed/34035363
http://dx.doi.org/10.1038/s41598-021-90303-6
_version_ 1783698031633235968
author Chen, Xiao
Liu, Jian
Guo, Zhiye
Wu, Tianqi
Hou, Jie
Cheng, Jianlin
author_facet Chen, Xiao
Liu, Jian
Guo, Zhiye
Wu, Tianqi
Hou, Jie
Cheng, Jianlin
author_sort Chen, Xiao
collection PubMed
description The inter-residue contact prediction and deep learning showed the promise to improve the estimation of protein model accuracy (EMA) in the 13th Critical Assessment of Protein Structure Prediction (CASP13). To further leverage the improved inter-residue distance predictions to enhance EMA, during the 2020 CASP14 experiment, we integrated several new inter-residue distance features with the existing model quality assessment features in several deep learning methods to predict the quality of protein structural models. According to the evaluation of performance in selecting the best model from the models of CASP14 targets, our three multi-model predictors of estimating model accuracy (MULTICOM-CONSTRUCT, MULTICOM-AI, and MULTICOM-CLUSTER) achieve the averaged loss of 0.073, 0.079, and 0.081, respectively, in terms of the global distance test score (GDT-TS). The three methods are ranked first, second, and third out of all 68 CASP14 predictors. MULTICOM-DEEP, the single-model predictor of estimating model accuracy (EMA), is ranked within top 10 among all the single-model EMA methods according to GDT-TS score loss. The results demonstrate that inter-residue distance features are valuable inputs for deep learning to predict the quality of protein structural models. However, larger training datasets and better ways of leveraging inter-residue distance information are needed to fully explore its potentials.
format Online
Article
Text
id pubmed-8149836
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-81498362021-05-26 Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14 Chen, Xiao Liu, Jian Guo, Zhiye Wu, Tianqi Hou, Jie Cheng, Jianlin Sci Rep Article The inter-residue contact prediction and deep learning showed the promise to improve the estimation of protein model accuracy (EMA) in the 13th Critical Assessment of Protein Structure Prediction (CASP13). To further leverage the improved inter-residue distance predictions to enhance EMA, during the 2020 CASP14 experiment, we integrated several new inter-residue distance features with the existing model quality assessment features in several deep learning methods to predict the quality of protein structural models. According to the evaluation of performance in selecting the best model from the models of CASP14 targets, our three multi-model predictors of estimating model accuracy (MULTICOM-CONSTRUCT, MULTICOM-AI, and MULTICOM-CLUSTER) achieve the averaged loss of 0.073, 0.079, and 0.081, respectively, in terms of the global distance test score (GDT-TS). The three methods are ranked first, second, and third out of all 68 CASP14 predictors. MULTICOM-DEEP, the single-model predictor of estimating model accuracy (EMA), is ranked within top 10 among all the single-model EMA methods according to GDT-TS score loss. The results demonstrate that inter-residue distance features are valuable inputs for deep learning to predict the quality of protein structural models. However, larger training datasets and better ways of leveraging inter-residue distance information are needed to fully explore its potentials. Nature Publishing Group UK 2021-05-25 /pmc/articles/PMC8149836/ /pubmed/34035363 http://dx.doi.org/10.1038/s41598-021-90303-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Chen, Xiao
Liu, Jian
Guo, Zhiye
Wu, Tianqi
Hou, Jie
Cheng, Jianlin
Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14
title Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14
title_full Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14
title_fullStr Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14
title_full_unstemmed Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14
title_short Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14
title_sort protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in casp14
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8149836/
https://www.ncbi.nlm.nih.gov/pubmed/34035363
http://dx.doi.org/10.1038/s41598-021-90303-6
work_keys_str_mv AT chenxiao proteinmodelaccuracyestimationempoweredbydeeplearningandinterresiduedistancepredictionincasp14
AT liujian proteinmodelaccuracyestimationempoweredbydeeplearningandinterresiduedistancepredictionincasp14
AT guozhiye proteinmodelaccuracyestimationempoweredbydeeplearningandinterresiduedistancepredictionincasp14
AT wutianqi proteinmodelaccuracyestimationempoweredbydeeplearningandinterresiduedistancepredictionincasp14
AT houjie proteinmodelaccuracyestimationempoweredbydeeplearningandinterresiduedistancepredictionincasp14
AT chengjianlin proteinmodelaccuracyestimationempoweredbydeeplearningandinterresiduedistancepredictionincasp14