Cargando…

An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties

BACKGROUND: DNA-binding hot spots are dominant and fundamental residues that contribute most of the binding free energy yet accounting for a small portion of protein–DNA interfaces. As experimental methods for identifying hot spots are time-consuming and costly, high-efficiency computational approac...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Sijia, Wang, Lihua, Zhao, Le, Li, Menglu, Liu, Mengya, Li, Ke, Bin, Yannan, Xia, Junfeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8130120/
https://www.ncbi.nlm.nih.gov/pubmed/34000983
http://dx.doi.org/10.1186/s12859-020-03871-1
_version_ 1783694451317669888
author Zhang, Sijia
Wang, Lihua
Zhao, Le
Li, Menglu
Liu, Mengya
Li, Ke
Bin, Yannan
Xia, Junfeng
author_facet Zhang, Sijia
Wang, Lihua
Zhao, Le
Li, Menglu
Liu, Mengya
Li, Ke
Bin, Yannan
Xia, Junfeng
author_sort Zhang, Sijia
collection PubMed
description BACKGROUND: DNA-binding hot spots are dominant and fundamental residues that contribute most of the binding free energy yet accounting for a small portion of protein–DNA interfaces. As experimental methods for identifying hot spots are time-consuming and costly, high-efficiency computational approaches are emerging as alternative pathways to experimental methods. RESULTS: Herein, we present a new computational method, termed inpPDH, for hot spot prediction. To improve the prediction performance, we extract hybrid features which incorporate traditional features and new interfacial neighbor properties. To remove redundant and irrelevant features, feature selection is employed using a two-step feature selection strategy. Finally, a subset of 7 optimal features are chosen to construct the predictor using support vector machine. The results on the benchmark dataset show that this proposed method yields significantly better prediction accuracy than those previously published methods in the literature. Moreover, a user-friendly web server for inpPDH is well established and is freely available at http://bioinfo.ahu.edu.cn/inpPDH. CONCLUSIONS: We have developed an accurate improved prediction model, inpPDH, for hot spot residues in protein–DNA binding interfaces by given the structure of a protein–DNA complex. Moreover, we identify a comprehensive and useful feature subset including the proposed interfacial neighbor features that has an important strength for identifying hot spot residues. Our results indicate that these features are more effective than the conventional features considered previously, and that the combination of interfacial neighbor features and traditional features may support the creation of a discriminative feature set for efficient prediction of hot spot residues in protein–DNA complexes. SUPPLEMENTARY INFORMATION: Supplementary information accompanies this paper at 10.1186/s12859-020-03871-1.
format Online
Article
Text
id pubmed-8130120
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-81301202021-05-18 An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties Zhang, Sijia Wang, Lihua Zhao, Le Li, Menglu Liu, Mengya Li, Ke Bin, Yannan Xia, Junfeng BMC Bioinformatics Research BACKGROUND: DNA-binding hot spots are dominant and fundamental residues that contribute most of the binding free energy yet accounting for a small portion of protein–DNA interfaces. As experimental methods for identifying hot spots are time-consuming and costly, high-efficiency computational approaches are emerging as alternative pathways to experimental methods. RESULTS: Herein, we present a new computational method, termed inpPDH, for hot spot prediction. To improve the prediction performance, we extract hybrid features which incorporate traditional features and new interfacial neighbor properties. To remove redundant and irrelevant features, feature selection is employed using a two-step feature selection strategy. Finally, a subset of 7 optimal features are chosen to construct the predictor using support vector machine. The results on the benchmark dataset show that this proposed method yields significantly better prediction accuracy than those previously published methods in the literature. Moreover, a user-friendly web server for inpPDH is well established and is freely available at http://bioinfo.ahu.edu.cn/inpPDH. CONCLUSIONS: We have developed an accurate improved prediction model, inpPDH, for hot spot residues in protein–DNA binding interfaces by given the structure of a protein–DNA complex. Moreover, we identify a comprehensive and useful feature subset including the proposed interfacial neighbor features that has an important strength for identifying hot spot residues. Our results indicate that these features are more effective than the conventional features considered previously, and that the combination of interfacial neighbor features and traditional features may support the creation of a discriminative feature set for efficient prediction of hot spot residues in protein–DNA complexes. SUPPLEMENTARY INFORMATION: Supplementary information accompanies this paper at 10.1186/s12859-020-03871-1. BioMed Central 2021-05-17 /pmc/articles/PMC8130120/ /pubmed/34000983 http://dx.doi.org/10.1186/s12859-020-03871-1 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zhang, Sijia
Wang, Lihua
Zhao, Le
Li, Menglu
Liu, Mengya
Li, Ke
Bin, Yannan
Xia, Junfeng
An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties
title An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties
title_full An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties
title_fullStr An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties
title_full_unstemmed An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties
title_short An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties
title_sort improved dna-binding hot spot residues prediction method by exploring interfacial neighbor properties
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8130120/
https://www.ncbi.nlm.nih.gov/pubmed/34000983
http://dx.doi.org/10.1186/s12859-020-03871-1
work_keys_str_mv AT zhangsijia animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties
AT wanglihua animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties
AT zhaole animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties
AT limenglu animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties
AT liumengya animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties
AT like animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties
AT binyannan animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties
AT xiajunfeng animproveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties
AT zhangsijia improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties
AT wanglihua improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties
AT zhaole improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties
AT limenglu improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties
AT liumengya improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties
AT like improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties
AT binyannan improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties
AT xiajunfeng improveddnabindinghotspotresiduespredictionmethodbyexploringinterfacialneighborproperties