Cargando…

ResBoost: characterizing and predicting catalytic residues in enzymes

BACKGROUND: Identifying the catalytic residues in enzymes can aid in understanding the molecular basis of an enzyme's function and has significant implications for designing new drugs, identifying genetic disorders, and engineering proteins with novel functions. Since experimentally determining...

Descripción completa

Detalles Bibliográficos
Autores principales: Alterovitz, Ron, Arvey, Aaron, Sankararaman, Sriram, Dallett, Carolina, Freund, Yoav, Sjölander, Kimmen
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2713229/
https://www.ncbi.nlm.nih.gov/pubmed/19558703
http://dx.doi.org/10.1186/1471-2105-10-197
_version_ 1782169561194299392
author Alterovitz, Ron
Arvey, Aaron
Sankararaman, Sriram
Dallett, Carolina
Freund, Yoav
Sjölander, Kimmen
author_facet Alterovitz, Ron
Arvey, Aaron
Sankararaman, Sriram
Dallett, Carolina
Freund, Yoav
Sjölander, Kimmen
author_sort Alterovitz, Ron
collection PubMed
description BACKGROUND: Identifying the catalytic residues in enzymes can aid in understanding the molecular basis of an enzyme's function and has significant implications for designing new drugs, identifying genetic disorders, and engineering proteins with novel functions. Since experimentally determining catalytic sites is expensive, better computational methods for identifying catalytic residues are needed. RESULTS: We propose ResBoost, a new computational method to learn characteristics of catalytic residues. The method effectively selects and combines rules of thumb into a simple, easily interpretable logical expression that can be used for prediction. We formally define the rules of thumb that are often used to narrow the list of candidate residues, including residue evolutionary conservation, 3D clustering, solvent accessibility, and hydrophilicity. ResBoost builds on two methods from machine learning, the AdaBoost algorithm and Alternating Decision Trees, and provides precise control over the inherent trade-off between sensitivity and specificity. We evaluated ResBoost using cross-validation on a dataset of 100 enzymes from the hand-curated Catalytic Site Atlas (CSA). CONCLUSION: ResBoost achieved 85% sensitivity for a 9.8% false positive rate and 73% sensitivity for a 5.7% false positive rate. ResBoost reduces the number of false positives by up to 56% compared to the use of evolutionary conservation scoring alone. We also illustrate the ability of ResBoost to identify recently validated catalytic residues not listed in the CSA.
format Text
id pubmed-2713229
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27132292009-07-21 ResBoost: characterizing and predicting catalytic residues in enzymes Alterovitz, Ron Arvey, Aaron Sankararaman, Sriram Dallett, Carolina Freund, Yoav Sjölander, Kimmen BMC Bioinformatics Methodology Article BACKGROUND: Identifying the catalytic residues in enzymes can aid in understanding the molecular basis of an enzyme's function and has significant implications for designing new drugs, identifying genetic disorders, and engineering proteins with novel functions. Since experimentally determining catalytic sites is expensive, better computational methods for identifying catalytic residues are needed. RESULTS: We propose ResBoost, a new computational method to learn characteristics of catalytic residues. The method effectively selects and combines rules of thumb into a simple, easily interpretable logical expression that can be used for prediction. We formally define the rules of thumb that are often used to narrow the list of candidate residues, including residue evolutionary conservation, 3D clustering, solvent accessibility, and hydrophilicity. ResBoost builds on two methods from machine learning, the AdaBoost algorithm and Alternating Decision Trees, and provides precise control over the inherent trade-off between sensitivity and specificity. We evaluated ResBoost using cross-validation on a dataset of 100 enzymes from the hand-curated Catalytic Site Atlas (CSA). CONCLUSION: ResBoost achieved 85% sensitivity for a 9.8% false positive rate and 73% sensitivity for a 5.7% false positive rate. ResBoost reduces the number of false positives by up to 56% compared to the use of evolutionary conservation scoring alone. We also illustrate the ability of ResBoost to identify recently validated catalytic residues not listed in the CSA. BioMed Central 2009-06-27 /pmc/articles/PMC2713229/ /pubmed/19558703 http://dx.doi.org/10.1186/1471-2105-10-197 Text en Copyright © 2009 Alterovitz et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Alterovitz, Ron
Arvey, Aaron
Sankararaman, Sriram
Dallett, Carolina
Freund, Yoav
Sjölander, Kimmen
ResBoost: characterizing and predicting catalytic residues in enzymes
title ResBoost: characterizing and predicting catalytic residues in enzymes
title_full ResBoost: characterizing and predicting catalytic residues in enzymes
title_fullStr ResBoost: characterizing and predicting catalytic residues in enzymes
title_full_unstemmed ResBoost: characterizing and predicting catalytic residues in enzymes
title_short ResBoost: characterizing and predicting catalytic residues in enzymes
title_sort resboost: characterizing and predicting catalytic residues in enzymes
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2713229/
https://www.ncbi.nlm.nih.gov/pubmed/19558703
http://dx.doi.org/10.1186/1471-2105-10-197
work_keys_str_mv AT alterovitzron resboostcharacterizingandpredictingcatalyticresiduesinenzymes
AT arveyaaron resboostcharacterizingandpredictingcatalyticresiduesinenzymes
AT sankararamansriram resboostcharacterizingandpredictingcatalyticresiduesinenzymes
AT dallettcarolina resboostcharacterizingandpredictingcatalyticresiduesinenzymes
AT freundyoav resboostcharacterizingandpredictingcatalyticresiduesinenzymes
AT sjolanderkimmen resboostcharacterizingandpredictingcatalyticresiduesinenzymes