Cargando…
An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties
BACKGROUND: DNA-binding hot spots are dominant and fundamental residues that contribute most of the binding free energy yet accounting for a small portion of protein–DNA interfaces. As experimental methods for identifying hot spots are time-consuming and costly, high-efficiency computational approac...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8130120/ https://www.ncbi.nlm.nih.gov/pubmed/34000983 http://dx.doi.org/10.1186/s12859-020-03871-1 |
_version_ | 1783694451317669888 |
---|---|
author | Zhang, Sijia Wang, Lihua Zhao, Le Li, Menglu Liu, Mengya Li, Ke Bin, Yannan Xia, Junfeng |
author_facet | Zhang, Sijia Wang, Lihua Zhao, Le Li, Menglu Liu, Mengya Li, Ke Bin, Yannan Xia, Junfeng |
author_sort | Zhang, Sijia |
collection | PubMed |
description | BACKGROUND: DNA-binding hot spots are dominant and fundamental residues that contribute most of the binding free energy yet accounting for a small portion of protein–DNA interfaces. As experimental methods for identifying hot spots are time-consuming and costly, high-efficiency computational approaches are emerging as alternative pathways to experimental methods. RESULTS: Herein, we present a new computational method, termed inpPDH, for hot spot prediction. To improve the prediction performance, we extract hybrid features which incorporate traditional features and new interfacial neighbor properties. To remove redundant and irrelevant features, feature selection is employed using a two-step feature selection strategy. Finally, a subset of 7 optimal features are chosen to construct the predictor using support vector machine. The results on the benchmark dataset show that this proposed method yields significantly better prediction accuracy than those previously published methods in the literature. Moreover, a user-friendly web server for inpPDH is well established and is freely available at http://bioinfo.ahu.edu.cn/inpPDH. CONCLUSIONS: We have developed an accurate improved prediction model, inpPDH, for hot spot residues in protein–DNA binding interfaces by given the structure of a protein–DNA complex. Moreover, we identify a comprehensive and useful feature subset including the proposed interfacial neighbor features that has an important strength for identifying hot spot residues. Our results indicate that these features are more effective than the conventional features considered previously, and that the combination of interfacial neighbor features and traditional features may support the creation of a discriminative feature set for efficient prediction of hot spot residues in protein–DNA complexes. SUPPLEMENTARY INFORMATION: Supplementary information accompanies this paper at 10.1186/s12859-020-03871-1. |
format | Online Article Text |
id | pubmed-8130120 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-81301202021-05-18 An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties Zhang, Sijia Wang, Lihua Zhao, Le Li, Menglu Liu, Mengya Li, Ke Bin, Yannan Xia, Junfeng BMC Bioinformatics Research BACKGROUND: DNA-binding hot spots are dominant and fundamental residues that contribute most of the binding free energy yet accounting for a small portion of protein–DNA interfaces. As experimental methods for identifying hot spots are time-consuming and costly, high-efficiency computational approaches are emerging as alternative pathways to experimental methods. RESULTS: Herein, we present a new computational method, termed inpPDH, for hot spot prediction. To improve the prediction performance, we extract hybrid features which incorporate traditional features and new interfacial neighbor properties. To remove redundant and irrelevant features, feature selection is employed using a two-step feature selection strategy. Finally, a subset of 7 optimal features are chosen to construct the predictor using support vector machine. The results on the benchmark dataset show that this proposed method yields significantly better prediction accuracy than those previously published methods in the literature. Moreover, a user-friendly web server for inpPDH is well established and is freely available at http://bioinfo.ahu.edu.cn/inpPDH. CONCLUSIONS: We have developed an accurate improved prediction model, inpPDH, for hot spot residues in protein–DNA binding interfaces by given the structure of a protein–DNA complex. Moreover, we identify a comprehensive and useful feature subset including the proposed interfacial neighbor features that has an important strength for identifying hot spot residues. Our results indicate that these features are more effective than the conventional features considered previously, and that the combination of interfacial neighbor features and traditional features may support the creation of a discriminative feature set for efficient prediction of hot spot residues in protein–DNA complexes. SUPPLEMENTARY INFORMATION: Supplementary information accompanies this paper at 10.1186/s12859-020-03871-1. BioMed Central 2021-05-17 /pmc/articles/PMC8130120/ /pubmed/34000983 http://dx.doi.org/10.1186/s12859-020-03871-1 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Zhang, Sijia Wang, Lihua Zhao, Le Li, Menglu Liu, Mengya Li, Ke Bin, Yannan Xia, Junfeng An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties |
title | An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties |
title_full | An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties |
title_fullStr | An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties |
title_full_unstemmed | An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties |
title_short | An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties |
title_sort | improved dna-binding hot spot residues prediction method by exploring interfacial neighbor properties |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8130120/ https://www.ncbi.nlm.nih.gov/pubmed/34000983 http://dx.doi.org/10.1186/s12859-020-03871-1 |
work_keys_str_mv | AT zhangsijia animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties AT wanglihua animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties AT zhaole animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties AT limenglu animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties AT liumengya animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties AT like animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties AT binyannan animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties AT xiajunfeng animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties AT zhangsijia improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties AT wanglihua improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties AT zhaole improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties AT limenglu improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties AT liumengya improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties AT like improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties AT binyannan improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties AT xiajunfeng improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties |