Cargando…

Detecting distant-homology protein structures by aligning deep neural-network based contact maps

Accurate prediction of atomic-level protein structure is important for annotating the biological functions of protein molecules and for designing new compounds to regulate the functions. Template-based modeling (TBM), which aims to construct structural models by copying and refining the structural f...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Wei, Wuyun, Qiqige, Li, Yang, Mortuza, S. M., Zhang, Chengxin, Pearce, Robin, Ruan, Jishou, Zhang, Yang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6818797/
https://www.ncbi.nlm.nih.gov/pubmed/31622328
http://dx.doi.org/10.1371/journal.pcbi.1007411
_version_ 1783463651629334528
author Zheng, Wei
Wuyun, Qiqige
Li, Yang
Mortuza, S. M.
Zhang, Chengxin
Pearce, Robin
Ruan, Jishou
Zhang, Yang
author_facet Zheng, Wei
Wuyun, Qiqige
Li, Yang
Mortuza, S. M.
Zhang, Chengxin
Pearce, Robin
Ruan, Jishou
Zhang, Yang
author_sort Zheng, Wei
collection PubMed
description Accurate prediction of atomic-level protein structure is important for annotating the biological functions of protein molecules and for designing new compounds to regulate the functions. Template-based modeling (TBM), which aims to construct structural models by copying and refining the structural frameworks of other known proteins, remains the most accurate method for protein structure prediction. Due to the difficulty in recognizing distant-homology templates, however, the accuracy of TBM decreases rapidly when the evolutionary relationship between the query and template vanishes. In this study, we propose a new method, CEthreader, which first predicts residue-residue contacts by coupling evolutionary precision matrices with deep residual convolutional neural-networks. The predicted contact maps are then integrated with sequence profile alignments to recognize structural templates from the PDB. The method was tested on two independent benchmark sets consisting collectively of 1,153 non-homologous protein targets, where CEthreader detected 176% or 36% more correct templates with a TM-score >0.5 than the best state-of-the-art profile- or contact-based threading methods, respectively, for the Hard targets that lacked homologous templates. Moreover, CEthreader was able to identify 114% or 20% more correct templates with the same Fold as the query, after excluding structures from the same SCOPe Superfamily, than the best profile- or contact-based threading methods. Detailed analyses show that the major advantage of CEthreader lies in the efficient coupling of contact maps with profile alignments, which helps recognize global fold of protein structures when the homologous relationship between the query and template is weak. These results demonstrate an efficient new strategy to combine ab initio contact map prediction with profile alignments to significantly improve the accuracy of template-based structure prediction, especially for distant-homology proteins.
format Online
Article
Text
id pubmed-6818797
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-68187972019-11-02 Detecting distant-homology protein structures by aligning deep neural-network based contact maps Zheng, Wei Wuyun, Qiqige Li, Yang Mortuza, S. M. Zhang, Chengxin Pearce, Robin Ruan, Jishou Zhang, Yang PLoS Comput Biol Research Article Accurate prediction of atomic-level protein structure is important for annotating the biological functions of protein molecules and for designing new compounds to regulate the functions. Template-based modeling (TBM), which aims to construct structural models by copying and refining the structural frameworks of other known proteins, remains the most accurate method for protein structure prediction. Due to the difficulty in recognizing distant-homology templates, however, the accuracy of TBM decreases rapidly when the evolutionary relationship between the query and template vanishes. In this study, we propose a new method, CEthreader, which first predicts residue-residue contacts by coupling evolutionary precision matrices with deep residual convolutional neural-networks. The predicted contact maps are then integrated with sequence profile alignments to recognize structural templates from the PDB. The method was tested on two independent benchmark sets consisting collectively of 1,153 non-homologous protein targets, where CEthreader detected 176% or 36% more correct templates with a TM-score >0.5 than the best state-of-the-art profile- or contact-based threading methods, respectively, for the Hard targets that lacked homologous templates. Moreover, CEthreader was able to identify 114% or 20% more correct templates with the same Fold as the query, after excluding structures from the same SCOPe Superfamily, than the best profile- or contact-based threading methods. Detailed analyses show that the major advantage of CEthreader lies in the efficient coupling of contact maps with profile alignments, which helps recognize global fold of protein structures when the homologous relationship between the query and template is weak. These results demonstrate an efficient new strategy to combine ab initio contact map prediction with profile alignments to significantly improve the accuracy of template-based structure prediction, especially for distant-homology proteins. Public Library of Science 2019-10-17 /pmc/articles/PMC6818797/ /pubmed/31622328 http://dx.doi.org/10.1371/journal.pcbi.1007411 Text en © 2019 Zheng et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zheng, Wei
Wuyun, Qiqige
Li, Yang
Mortuza, S. M.
Zhang, Chengxin
Pearce, Robin
Ruan, Jishou
Zhang, Yang
Detecting distant-homology protein structures by aligning deep neural-network based contact maps
title Detecting distant-homology protein structures by aligning deep neural-network based contact maps
title_full Detecting distant-homology protein structures by aligning deep neural-network based contact maps
title_fullStr Detecting distant-homology protein structures by aligning deep neural-network based contact maps
title_full_unstemmed Detecting distant-homology protein structures by aligning deep neural-network based contact maps
title_short Detecting distant-homology protein structures by aligning deep neural-network based contact maps
title_sort detecting distant-homology protein structures by aligning deep neural-network based contact maps
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6818797/
https://www.ncbi.nlm.nih.gov/pubmed/31622328
http://dx.doi.org/10.1371/journal.pcbi.1007411
work_keys_str_mv AT zhengwei detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps
AT wuyunqiqige detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps
AT liyang detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps
AT mortuzasm detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps
AT zhangchengxin detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps
AT pearcerobin detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps
AT ruanjishou detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps
AT zhangyang detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps