Cargando…

L1pred: A Sequence-Based Prediction Tool for Catalytic Residues in Enzymes with the L1-logreg Classifier

To understand enzyme functions, identifying the catalytic residues is a usual first step. Moreover, knowledge about catalytic residues is also useful for protein engineering and drug-design. However, to experimentally identify catalytic residues remains challenging for reasons of time and cost. Ther...

Descripción completa

Detalles Bibliográficos
Autores principales:	Dou, Yongchao, Wang, Jun, Yang, Jialiang, Zhang, Chi
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2012
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3338704/ https://www.ncbi.nlm.nih.gov/pubmed/22558194 http://dx.doi.org/10.1371/journal.pone.0035666

_version_	1782231245038551040
author	Dou, Yongchao Wang, Jun Yang, Jialiang Zhang, Chi
author_facet	Dou, Yongchao Wang, Jun Yang, Jialiang Zhang, Chi
author_sort	Dou, Yongchao
collection	PubMed
description	To understand enzyme functions, identifying the catalytic residues is a usual first step. Moreover, knowledge about catalytic residues is also useful for protein engineering and drug-design. However, to experimentally identify catalytic residues remains challenging for reasons of time and cost. Therefore, computational methods have been explored to predict catalytic residues. Here, we developed a new algorithm, L1pred, for catalytic residue prediction, by using the L1-logreg classifier to integrate eight sequence-based scoring functions. We tested L1pred and compared it against several existing sequence-based methods on carefully designed datasets Data604 and Data63. With ten-fold cross-validation, L1pred showed the area under precision-recall curve (AUPR) and the area under ROC curve (AUC) of 0.2198 and 0.9494 on the training dataset, Data604, respectively. In addition, on the independent test dataset, Data63, it showed the AUPR and AUC values of 0.2636 and 0.9375, respectively. Compared with other sequence-based methods, L1pred showed the best performance on both datasets. We also analyzed the importance of each attribute in the algorithm, and found that all the scores contributed more or less equally to the L1pred performance.
format	Online Article Text
id	pubmed-3338704
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-33387042012-05-03 L1pred: A Sequence-Based Prediction Tool for Catalytic Residues in Enzymes with the L1-logreg Classifier Dou, Yongchao Wang, Jun Yang, Jialiang Zhang, Chi PLoS One Research Article To understand enzyme functions, identifying the catalytic residues is a usual first step. Moreover, knowledge about catalytic residues is also useful for protein engineering and drug-design. However, to experimentally identify catalytic residues remains challenging for reasons of time and cost. Therefore, computational methods have been explored to predict catalytic residues. Here, we developed a new algorithm, L1pred, for catalytic residue prediction, by using the L1-logreg classifier to integrate eight sequence-based scoring functions. We tested L1pred and compared it against several existing sequence-based methods on carefully designed datasets Data604 and Data63. With ten-fold cross-validation, L1pred showed the area under precision-recall curve (AUPR) and the area under ROC curve (AUC) of 0.2198 and 0.9494 on the training dataset, Data604, respectively. In addition, on the independent test dataset, Data63, it showed the AUPR and AUC values of 0.2636 and 0.9375, respectively. Compared with other sequence-based methods, L1pred showed the best performance on both datasets. We also analyzed the importance of each attribute in the algorithm, and found that all the scores contributed more or less equally to the L1pred performance. Public Library of Science 2012-04-27 /pmc/articles/PMC3338704/ /pubmed/22558194 http://dx.doi.org/10.1371/journal.pone.0035666 Text en Dou et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Dou, Yongchao Wang, Jun Yang, Jialiang Zhang, Chi L1pred: A Sequence-Based Prediction Tool for Catalytic Residues in Enzymes with the L1-logreg Classifier
title	L1pred: A Sequence-Based Prediction Tool for Catalytic Residues in Enzymes with the L1-logreg Classifier
title_full	L1pred: A Sequence-Based Prediction Tool for Catalytic Residues in Enzymes with the L1-logreg Classifier
title_fullStr	L1pred: A Sequence-Based Prediction Tool for Catalytic Residues in Enzymes with the L1-logreg Classifier
title_full_unstemmed	L1pred: A Sequence-Based Prediction Tool for Catalytic Residues in Enzymes with the L1-logreg Classifier
title_short	L1pred: A Sequence-Based Prediction Tool for Catalytic Residues in Enzymes with the L1-logreg Classifier
title_sort	l1pred: a sequence-based prediction tool for catalytic residues in enzymes with the l1-logreg classifier
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3338704/ https://www.ncbi.nlm.nih.gov/pubmed/22558194 http://dx.doi.org/10.1371/journal.pone.0035666
work_keys_str_mv	AT douyongchao l1predasequencebasedpredictiontoolforcatalyticresiduesinenzymeswiththel1logregclassifier AT wangjun l1predasequencebasedpredictiontoolforcatalyticresiduesinenzymeswiththel1logregclassifier AT yangjialiang l1predasequencebasedpredictiontoolforcatalyticresiduesinenzymeswiththel1logregclassifier AT zhangchi l1predasequencebasedpredictiontoolforcatalyticresiduesinenzymeswiththel1logregclassifier

L1pred: A Sequence-Based Prediction Tool for Catalytic Residues in Enzymes with the L1-logreg Classifier

Ejemplares similares