Cargando…
CRHunter: integrating multifaceted information to predict catalytic residues in enzymes
A variety of algorithms have been developed for catalytic residue prediction based on either feature- or template-based methodology. However, no studies have systematically compared these two strategies and further considered whether their combination could improve the prediction performance. Herein...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5036049/ https://www.ncbi.nlm.nih.gov/pubmed/27665935 http://dx.doi.org/10.1038/srep34044 |
_version_ | 1782455484011249664 |
---|---|
author | Sun, Jun Wang, Jia Xiong, Dan Hu, Jian Liu, Rong |
author_facet | Sun, Jun Wang, Jia Xiong, Dan Hu, Jian Liu, Rong |
author_sort | Sun, Jun |
collection | PubMed |
description | A variety of algorithms have been developed for catalytic residue prediction based on either feature- or template-based methodology. However, no studies have systematically compared these two strategies and further considered whether their combination could improve the prediction performance. Herein, we developed an integrative algorithm named CRHunter by simultaneously using the complementarity between feature- and template-based methodologies and that between structural and sequence information. Several novel structural features were generated by the Delaunay triangulation and Laplacian transformation of enzyme structures. Combining these features with traditional descriptors, we invented two support vector machine feature predictors based on both structural and sequence information. Furthermore, we established two template predictors using structure and profile alignments. Evaluated on datasets with different levels of homology, our feature predictors achieve relatively stable performance, whereas our template predictors yield poor results when the homological relationships become weak. Nevertheless, the hybrid algorithm CRHunter consistently achieves optimal performance among all our predictors. We also illustrate that our methodology can be applied to the predicted structures of enzymes. Compared with state-of-the-art methods, CRHunter yields comparable or better performance on various datasets. Finally, the application of this algorithm to structural genomics targets sheds light on solved protein structures with unknown functions. |
format | Online Article Text |
id | pubmed-5036049 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-50360492016-09-30 CRHunter: integrating multifaceted information to predict catalytic residues in enzymes Sun, Jun Wang, Jia Xiong, Dan Hu, Jian Liu, Rong Sci Rep Article A variety of algorithms have been developed for catalytic residue prediction based on either feature- or template-based methodology. However, no studies have systematically compared these two strategies and further considered whether their combination could improve the prediction performance. Herein, we developed an integrative algorithm named CRHunter by simultaneously using the complementarity between feature- and template-based methodologies and that between structural and sequence information. Several novel structural features were generated by the Delaunay triangulation and Laplacian transformation of enzyme structures. Combining these features with traditional descriptors, we invented two support vector machine feature predictors based on both structural and sequence information. Furthermore, we established two template predictors using structure and profile alignments. Evaluated on datasets with different levels of homology, our feature predictors achieve relatively stable performance, whereas our template predictors yield poor results when the homological relationships become weak. Nevertheless, the hybrid algorithm CRHunter consistently achieves optimal performance among all our predictors. We also illustrate that our methodology can be applied to the predicted structures of enzymes. Compared with state-of-the-art methods, CRHunter yields comparable or better performance on various datasets. Finally, the application of this algorithm to structural genomics targets sheds light on solved protein structures with unknown functions. Nature Publishing Group 2016-09-26 /pmc/articles/PMC5036049/ /pubmed/27665935 http://dx.doi.org/10.1038/srep34044 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Sun, Jun Wang, Jia Xiong, Dan Hu, Jian Liu, Rong CRHunter: integrating multifaceted information to predict catalytic residues in enzymes |
title | CRHunter: integrating multifaceted information to predict catalytic residues in enzymes |
title_full | CRHunter: integrating multifaceted information to predict catalytic residues in enzymes |
title_fullStr | CRHunter: integrating multifaceted information to predict catalytic residues in enzymes |
title_full_unstemmed | CRHunter: integrating multifaceted information to predict catalytic residues in enzymes |
title_short | CRHunter: integrating multifaceted information to predict catalytic residues in enzymes |
title_sort | crhunter: integrating multifaceted information to predict catalytic residues in enzymes |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5036049/ https://www.ncbi.nlm.nih.gov/pubmed/27665935 http://dx.doi.org/10.1038/srep34044 |
work_keys_str_mv | AT sunjun crhunterintegratingmultifacetedinformationtopredictcatalyticresiduesinenzymes AT wangjia crhunterintegratingmultifacetedinformationtopredictcatalyticresiduesinenzymes AT xiongdan crhunterintegratingmultifacetedinformationtopredictcatalyticresiduesinenzymes AT hujian crhunterintegratingmultifacetedinformationtopredictcatalyticresiduesinenzymes AT liurong crhunterintegratingmultifacetedinformationtopredictcatalyticresiduesinenzymes |