Cargando…

DBSI: DNA-binding site identifier

In this study, we present the DNA-Binding Site Identifier (DBSI), a new structure-based method for predicting protein interaction sites for DNA binding. DBSI was trained and validated on a data set of 263 proteins (TRAIN-263), tested on an independent set of protein-DNA complexes (TEST-206) and data...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Xiaolei, Ericksen, Spencer S., Mitchell, Julie C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3763564/
https://www.ncbi.nlm.nih.gov/pubmed/23873960
http://dx.doi.org/10.1093/nar/gkt617
_version_ 1782283036557049856
author Zhu, Xiaolei
Ericksen, Spencer S.
Mitchell, Julie C.
author_facet Zhu, Xiaolei
Ericksen, Spencer S.
Mitchell, Julie C.
author_sort Zhu, Xiaolei
collection PubMed
description In this study, we present the DNA-Binding Site Identifier (DBSI), a new structure-based method for predicting protein interaction sites for DNA binding. DBSI was trained and validated on a data set of 263 proteins (TRAIN-263), tested on an independent set of protein-DNA complexes (TEST-206) and data sets of 29 unbound (APO-29) and 30 bound (HOLO-30) protein structures distinct from the training data. We computed 480 candidate features for identifying protein residues that bind DNA, including new features that capture the electrostatic microenvironment within shells near the protein surface. Our iterative feature selection process identified features important in other models, as well as features unique to the DBSI model, such as a banded electrostatic feature with spatial separation comparable with the canonical width of the DNA minor groove. Validations and comparisons with established methods using a range of performance metrics clearly demonstrate the predictive advantage of DBSI, and its comparable performance on unbound (APO-29) and bound (HOLO-30) conformations demonstrates robustness to binding-induced protein conformational changes. Finally, we offer our feature data table to others for integration into their own models or for testing improved feature selection and model training strategies based on DBSI.
format Online
Article
Text
id pubmed-3763564
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-37635642013-09-10 DBSI: DNA-binding site identifier Zhu, Xiaolei Ericksen, Spencer S. Mitchell, Julie C. Nucleic Acids Res Methods Online In this study, we present the DNA-Binding Site Identifier (DBSI), a new structure-based method for predicting protein interaction sites for DNA binding. DBSI was trained and validated on a data set of 263 proteins (TRAIN-263), tested on an independent set of protein-DNA complexes (TEST-206) and data sets of 29 unbound (APO-29) and 30 bound (HOLO-30) protein structures distinct from the training data. We computed 480 candidate features for identifying protein residues that bind DNA, including new features that capture the electrostatic microenvironment within shells near the protein surface. Our iterative feature selection process identified features important in other models, as well as features unique to the DBSI model, such as a banded electrostatic feature with spatial separation comparable with the canonical width of the DNA minor groove. Validations and comparisons with established methods using a range of performance metrics clearly demonstrate the predictive advantage of DBSI, and its comparable performance on unbound (APO-29) and bound (HOLO-30) conformations demonstrates robustness to binding-induced protein conformational changes. Finally, we offer our feature data table to others for integration into their own models or for testing improved feature selection and model training strategies based on DBSI. Oxford University Press 2013-09 2013-07-19 /pmc/articles/PMC3763564/ /pubmed/23873960 http://dx.doi.org/10.1093/nar/gkt617 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Zhu, Xiaolei
Ericksen, Spencer S.
Mitchell, Julie C.
DBSI: DNA-binding site identifier
title DBSI: DNA-binding site identifier
title_full DBSI: DNA-binding site identifier
title_fullStr DBSI: DNA-binding site identifier
title_full_unstemmed DBSI: DNA-binding site identifier
title_short DBSI: DNA-binding site identifier
title_sort dbsi: dna-binding site identifier
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3763564/
https://www.ncbi.nlm.nih.gov/pubmed/23873960
http://dx.doi.org/10.1093/nar/gkt617
work_keys_str_mv AT zhuxiaolei dbsidnabindingsiteidentifier
AT ericksenspencers dbsidnabindingsiteidentifier
AT mitchelljuliec dbsidnabindingsiteidentifier