Cargando…
Detecting distant-homology protein structures by aligning deep neural-network based contact maps
Accurate prediction of atomic-level protein structure is important for annotating the biological functions of protein molecules and for designing new compounds to regulate the functions. Template-based modeling (TBM), which aims to construct structural models by copying and refining the structural f...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6818797/ https://www.ncbi.nlm.nih.gov/pubmed/31622328 http://dx.doi.org/10.1371/journal.pcbi.1007411 |
_version_ | 1783463651629334528 |
---|---|
author | Zheng, Wei Wuyun, Qiqige Li, Yang Mortuza, S. M. Zhang, Chengxin Pearce, Robin Ruan, Jishou Zhang, Yang |
author_facet | Zheng, Wei Wuyun, Qiqige Li, Yang Mortuza, S. M. Zhang, Chengxin Pearce, Robin Ruan, Jishou Zhang, Yang |
author_sort | Zheng, Wei |
collection | PubMed |
description | Accurate prediction of atomic-level protein structure is important for annotating the biological functions of protein molecules and for designing new compounds to regulate the functions. Template-based modeling (TBM), which aims to construct structural models by copying and refining the structural frameworks of other known proteins, remains the most accurate method for protein structure prediction. Due to the difficulty in recognizing distant-homology templates, however, the accuracy of TBM decreases rapidly when the evolutionary relationship between the query and template vanishes. In this study, we propose a new method, CEthreader, which first predicts residue-residue contacts by coupling evolutionary precision matrices with deep residual convolutional neural-networks. The predicted contact maps are then integrated with sequence profile alignments to recognize structural templates from the PDB. The method was tested on two independent benchmark sets consisting collectively of 1,153 non-homologous protein targets, where CEthreader detected 176% or 36% more correct templates with a TM-score >0.5 than the best state-of-the-art profile- or contact-based threading methods, respectively, for the Hard targets that lacked homologous templates. Moreover, CEthreader was able to identify 114% or 20% more correct templates with the same Fold as the query, after excluding structures from the same SCOPe Superfamily, than the best profile- or contact-based threading methods. Detailed analyses show that the major advantage of CEthreader lies in the efficient coupling of contact maps with profile alignments, which helps recognize global fold of protein structures when the homologous relationship between the query and template is weak. These results demonstrate an efficient new strategy to combine ab initio contact map prediction with profile alignments to significantly improve the accuracy of template-based structure prediction, especially for distant-homology proteins. |
format | Online Article Text |
id | pubmed-6818797 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-68187972019-11-02 Detecting distant-homology protein structures by aligning deep neural-network based contact maps Zheng, Wei Wuyun, Qiqige Li, Yang Mortuza, S. M. Zhang, Chengxin Pearce, Robin Ruan, Jishou Zhang, Yang PLoS Comput Biol Research Article Accurate prediction of atomic-level protein structure is important for annotating the biological functions of protein molecules and for designing new compounds to regulate the functions. Template-based modeling (TBM), which aims to construct structural models by copying and refining the structural frameworks of other known proteins, remains the most accurate method for protein structure prediction. Due to the difficulty in recognizing distant-homology templates, however, the accuracy of TBM decreases rapidly when the evolutionary relationship between the query and template vanishes. In this study, we propose a new method, CEthreader, which first predicts residue-residue contacts by coupling evolutionary precision matrices with deep residual convolutional neural-networks. The predicted contact maps are then integrated with sequence profile alignments to recognize structural templates from the PDB. The method was tested on two independent benchmark sets consisting collectively of 1,153 non-homologous protein targets, where CEthreader detected 176% or 36% more correct templates with a TM-score >0.5 than the best state-of-the-art profile- or contact-based threading methods, respectively, for the Hard targets that lacked homologous templates. Moreover, CEthreader was able to identify 114% or 20% more correct templates with the same Fold as the query, after excluding structures from the same SCOPe Superfamily, than the best profile- or contact-based threading methods. Detailed analyses show that the major advantage of CEthreader lies in the efficient coupling of contact maps with profile alignments, which helps recognize global fold of protein structures when the homologous relationship between the query and template is weak. These results demonstrate an efficient new strategy to combine ab initio contact map prediction with profile alignments to significantly improve the accuracy of template-based structure prediction, especially for distant-homology proteins. Public Library of Science 2019-10-17 /pmc/articles/PMC6818797/ /pubmed/31622328 http://dx.doi.org/10.1371/journal.pcbi.1007411 Text en © 2019 Zheng et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Zheng, Wei Wuyun, Qiqige Li, Yang Mortuza, S. M. Zhang, Chengxin Pearce, Robin Ruan, Jishou Zhang, Yang Detecting distant-homology protein structures by aligning deep neural-network based contact maps |
title | Detecting distant-homology protein structures by aligning deep neural-network based contact maps |
title_full | Detecting distant-homology protein structures by aligning deep neural-network based contact maps |
title_fullStr | Detecting distant-homology protein structures by aligning deep neural-network based contact maps |
title_full_unstemmed | Detecting distant-homology protein structures by aligning deep neural-network based contact maps |
title_short | Detecting distant-homology protein structures by aligning deep neural-network based contact maps |
title_sort | detecting distant-homology protein structures by aligning deep neural-network based contact maps |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6818797/ https://www.ncbi.nlm.nih.gov/pubmed/31622328 http://dx.doi.org/10.1371/journal.pcbi.1007411 |
work_keys_str_mv | AT zhengwei detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps AT wuyunqiqige detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps AT liyang detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps AT mortuzasm detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps AT zhangchengxin detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps AT pearcerobin detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps AT ruanjishou detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps AT zhangyang detectingdistanthomologyproteinstructuresbyaligningdeepneuralnetworkbasedcontactmaps |