Cargando…
Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures
Predicting DNA-binding residues from a protein three-dimensional structure is a key task of computational structural proteomics. In the present study, based on machine learning technology, we aim to explore a reduced set of weighted average features for improving prediction of DNA-binding residues o...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3234263/ https://www.ncbi.nlm.nih.gov/pubmed/22174808 http://dx.doi.org/10.1371/journal.pone.0028440 |
_version_ | 1782218496627703808 |
---|---|
author | Xiong, Yi Xia, Junfeng Zhang, Wen Liu, Juan |
author_facet | Xiong, Yi Xia, Junfeng Zhang, Wen Liu, Juan |
author_sort | Xiong, Yi |
collection | PubMed |
description | Predicting DNA-binding residues from a protein three-dimensional structure is a key task of computational structural proteomics. In the present study, based on machine learning technology, we aim to explore a reduced set of weighted average features for improving prediction of DNA-binding residues on protein surfaces. Via constructing the spatial environment around a DNA-binding residue, a novel weighting factor is first proposed to quantify the distance-dependent contribution of each neighboring residue in determining the location of a binding residue. Then, a weighted average scheme is introduced to represent the surface patch of the considering residue. Finally, the classifier is trained on the reduced set of these weighted average features, consisting of evolutionary profile, interface propensity, betweenness centrality and solvent surface area of side chain. Experimental results on 5-fold cross validation and independent tests indicate that the new feature set are effective to describe DNA-binding residues and our approach has significantly better performance than two previous methods. Furthermore, a brief case study suggests that the weighted average features are powerful for identifying DNA-binding residues and are promising for further study of protein structure-function relationship. The source code and datasets are available upon request. |
format | Online Article Text |
id | pubmed-3234263 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-32342632011-12-15 Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures Xiong, Yi Xia, Junfeng Zhang, Wen Liu, Juan PLoS One Research Article Predicting DNA-binding residues from a protein three-dimensional structure is a key task of computational structural proteomics. In the present study, based on machine learning technology, we aim to explore a reduced set of weighted average features for improving prediction of DNA-binding residues on protein surfaces. Via constructing the spatial environment around a DNA-binding residue, a novel weighting factor is first proposed to quantify the distance-dependent contribution of each neighboring residue in determining the location of a binding residue. Then, a weighted average scheme is introduced to represent the surface patch of the considering residue. Finally, the classifier is trained on the reduced set of these weighted average features, consisting of evolutionary profile, interface propensity, betweenness centrality and solvent surface area of side chain. Experimental results on 5-fold cross validation and independent tests indicate that the new feature set are effective to describe DNA-binding residues and our approach has significantly better performance than two previous methods. Furthermore, a brief case study suggests that the weighted average features are powerful for identifying DNA-binding residues and are promising for further study of protein structure-function relationship. The source code and datasets are available upon request. Public Library of Science 2011-12-08 /pmc/articles/PMC3234263/ /pubmed/22174808 http://dx.doi.org/10.1371/journal.pone.0028440 Text en Xiong et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Xiong, Yi Xia, Junfeng Zhang, Wen Liu, Juan Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures |
title | Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures |
title_full | Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures |
title_fullStr | Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures |
title_full_unstemmed | Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures |
title_short | Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D Structures |
title_sort | exploiting a reduced set of weighted average features to improve prediction of dna-binding residues from 3d structures |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3234263/ https://www.ncbi.nlm.nih.gov/pubmed/22174808 http://dx.doi.org/10.1371/journal.pone.0028440 |
work_keys_str_mv | AT xiongyi exploitingareducedsetofweightedaveragefeaturestoimprovepredictionofdnabindingresiduesfrom3dstructures AT xiajunfeng exploitingareducedsetofweightedaveragefeaturestoimprovepredictionofdnabindingresiduesfrom3dstructures AT zhangwen exploitingareducedsetofweightedaveragefeaturestoimprovepredictionofdnabindingresiduesfrom3dstructures AT liujuan exploitingareducedsetofweightedaveragefeaturestoimprovepredictionofdnabindingresiduesfrom3dstructures |