Cargando…

Automatic prediction of catalytic residues by modeling residue structural neighborhood

BACKGROUND: Prediction of catalytic residues is a major step in characterizing the function of enzymes. In its simpler formulation, the problem can be cast into a binary classification task at the residue level, by predicting whether the residue is directly involved in the catalytic process. The tas...

Descripción completa

Detalles Bibliográficos
Autores principales: Cilia, Elisa, Passerini, Andrea
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2844391/
https://www.ncbi.nlm.nih.gov/pubmed/20199672
http://dx.doi.org/10.1186/1471-2105-11-115
_version_ 1782179300761403392
author Cilia, Elisa
Passerini, Andrea
author_facet Cilia, Elisa
Passerini, Andrea
author_sort Cilia, Elisa
collection PubMed
description BACKGROUND: Prediction of catalytic residues is a major step in characterizing the function of enzymes. In its simpler formulation, the problem can be cast into a binary classification task at the residue level, by predicting whether the residue is directly involved in the catalytic process. The task is quite hard also when structural information is available, due to the rather wide range of roles a functional residue can play and to the large imbalance between the number of catalytic and non-catalytic residues. RESULTS: We developed an effective representation of structural information by modeling spherical regions around candidate residues, and extracting statistics on the properties of their content such as physico-chemical properties, atomic density, flexibility, presence of water molecules. We trained an SVM classifier combining our features with sequence-based information and previously developed 3D features, and compared its performance with the most recent state-of-the-art approaches on different benchmark datasets. We further analyzed the discriminant power of the information provided by the presence of heterogens in the residue neighborhood. CONCLUSIONS: Our structure-based method achieves consistent improvements on all tested datasets over both sequence-based and structure-based state-of-the-art approaches. Structural neighborhood information is shown to be responsible for such results, and predicting the presence of nearby heterogens seems to be a promising direction for further improvements.
format Text
id pubmed-2844391
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28443912010-03-24 Automatic prediction of catalytic residues by modeling residue structural neighborhood Cilia, Elisa Passerini, Andrea BMC Bioinformatics Research article BACKGROUND: Prediction of catalytic residues is a major step in characterizing the function of enzymes. In its simpler formulation, the problem can be cast into a binary classification task at the residue level, by predicting whether the residue is directly involved in the catalytic process. The task is quite hard also when structural information is available, due to the rather wide range of roles a functional residue can play and to the large imbalance between the number of catalytic and non-catalytic residues. RESULTS: We developed an effective representation of structural information by modeling spherical regions around candidate residues, and extracting statistics on the properties of their content such as physico-chemical properties, atomic density, flexibility, presence of water molecules. We trained an SVM classifier combining our features with sequence-based information and previously developed 3D features, and compared its performance with the most recent state-of-the-art approaches on different benchmark datasets. We further analyzed the discriminant power of the information provided by the presence of heterogens in the residue neighborhood. CONCLUSIONS: Our structure-based method achieves consistent improvements on all tested datasets over both sequence-based and structure-based state-of-the-art approaches. Structural neighborhood information is shown to be responsible for such results, and predicting the presence of nearby heterogens seems to be a promising direction for further improvements. BioMed Central 2010-03-03 /pmc/articles/PMC2844391/ /pubmed/20199672 http://dx.doi.org/10.1186/1471-2105-11-115 Text en Copyright ©2010 Cilia and Passerini; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Cilia, Elisa
Passerini, Andrea
Automatic prediction of catalytic residues by modeling residue structural neighborhood
title Automatic prediction of catalytic residues by modeling residue structural neighborhood
title_full Automatic prediction of catalytic residues by modeling residue structural neighborhood
title_fullStr Automatic prediction of catalytic residues by modeling residue structural neighborhood
title_full_unstemmed Automatic prediction of catalytic residues by modeling residue structural neighborhood
title_short Automatic prediction of catalytic residues by modeling residue structural neighborhood
title_sort automatic prediction of catalytic residues by modeling residue structural neighborhood
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2844391/
https://www.ncbi.nlm.nih.gov/pubmed/20199672
http://dx.doi.org/10.1186/1471-2105-11-115
work_keys_str_mv AT ciliaelisa automaticpredictionofcatalyticresiduesbymodelingresiduestructuralneighborhood
AT passeriniandrea automaticpredictionofcatalyticresiduesbymodelingresiduestructuralneighborhood