Cargando…

Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures

Predicting DNA-binding residues from a protein three-dimensional structure is a key task of computational structural proteomics. In the present study, based on machine learning technology, we aim to explore a reduced set of weighted average features for improving prediction of DNA-binding residues o...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiong, Yi, Xia, Junfeng, Zhang, Wen, Liu, Juan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3234263/
https://www.ncbi.nlm.nih.gov/pubmed/22174808
http://dx.doi.org/10.1371/journal.pone.0028440
_version_ 1782218496627703808
author Xiong, Yi
Xia, Junfeng
Zhang, Wen
Liu, Juan
author_facet Xiong, Yi
Xia, Junfeng
Zhang, Wen
Liu, Juan
author_sort Xiong, Yi
collection PubMed
description Predicting DNA-binding residues from a protein three-dimensional structure is a key task of computational structural proteomics. In the present study, based on machine learning technology, we aim to explore a reduced set of weighted average features for improving prediction of DNA-binding residues on protein surfaces. Via constructing the spatial environment around a DNA-binding residue, a novel weighting factor is first proposed to quantify the distance-dependent contribution of each neighboring residue in determining the location of a binding residue. Then, a weighted average scheme is introduced to represent the surface patch of the considering residue. Finally, the classifier is trained on the reduced set of these weighted average features, consisting of evolutionary profile, interface propensity, betweenness centrality and solvent surface area of side chain. Experimental results on 5-fold cross validation and independent tests indicate that the new feature set are effective to describe DNA-binding residues and our approach has significantly better performance than two previous methods. Furthermore, a brief case study suggests that the weighted average features are powerful for identifying DNA-binding residues and are promising for further study of protein structure-function relationship. The source code and datasets are available upon request.
format Online
Article
Text
id pubmed-3234263
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32342632011-12-15 Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures Xiong, Yi Xia, Junfeng Zhang, Wen Liu, Juan PLoS One Research Article Predicting DNA-binding residues from a protein three-dimensional structure is a key task of computational structural proteomics. In the present study, based on machine learning technology, we aim to explore a reduced set of weighted average features for improving prediction of DNA-binding residues on protein surfaces. Via constructing the spatial environment around a DNA-binding residue, a novel weighting factor is first proposed to quantify the distance-dependent contribution of each neighboring residue in determining the location of a binding residue. Then, a weighted average scheme is introduced to represent the surface patch of the considering residue. Finally, the classifier is trained on the reduced set of these weighted average features, consisting of evolutionary profile, interface propensity, betweenness centrality and solvent surface area of side chain. Experimental results on 5-fold cross validation and independent tests indicate that the new feature set are effective to describe DNA-binding residues and our approach has significantly better performance than two previous methods. Furthermore, a brief case study suggests that the weighted average features are powerful for identifying DNA-binding residues and are promising for further study of protein structure-function relationship. The source code and datasets are available upon request. Public Library of Science 2011-12-08 /pmc/articles/PMC3234263/ /pubmed/22174808 http://dx.doi.org/10.1371/journal.pone.0028440 Text en Xiong et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Xiong, Yi
Xia, Junfeng
Zhang, Wen
Liu, Juan
Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures
title Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures
title_full Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures
title_fullStr Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures
title_full_unstemmed Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures
title_short Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures
title_sort exploiting a reduced set of weighted average features to improve prediction of dna-binding residues from 3d structures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3234263/
https://www.ncbi.nlm.nih.gov/pubmed/22174808
http://dx.doi.org/10.1371/journal.pone.0028440
work_keys_str_mv AT xiongyi exploitingareducedsetofweightedaveragefeaturestoimprovepredictionofdnabindingresiduesfrom3dstructures
AT xiajunfeng exploitingareducedsetofweightedaveragefeaturestoimprovepredictionofdnabindingresiduesfrom3dstructures
AT zhangwen exploitingareducedsetofweightedaveragefeaturestoimprovepredictionofdnabindingresiduesfrom3dstructures
AT liujuan exploitingareducedsetofweightedaveragefeaturestoimprovepredictionofdnabindingresiduesfrom3dstructures