Cargando…

Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information

DNA–protein interactions appear as pivotal roles in diverse biological procedures and are paramount for cell metabolism, while identifying them with computational means is a kind of prudent scenario in depleting in vitro and in vivo experimental charging. A variety of state-of-the-art investigations...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Cong, Ding, Yijie, Tang, Jijun, Song, Jian, Guo, Fei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6149935/
https://www.ncbi.nlm.nih.gov/pubmed/29182548
http://dx.doi.org/10.3390/molecules22122079
_version_ 1783356900625088512
author Shen, Cong
Ding, Yijie
Tang, Jijun
Song, Jian
Guo, Fei
author_facet Shen, Cong
Ding, Yijie
Tang, Jijun
Song, Jian
Guo, Fei
author_sort Shen, Cong
collection PubMed
description DNA–protein interactions appear as pivotal roles in diverse biological procedures and are paramount for cell metabolism, while identifying them with computational means is a kind of prudent scenario in depleting in vitro and in vivo experimental charging. A variety of state-of-the-art investigations have been elucidated to improve the accuracy of the DNA–protein binding sites prediction. Nevertheless, structure-based approaches are limited under the condition without 3D information, and the predictive validity is still refinable. In this essay, we address a kind of competitive method called Multi-scale Local Average Blocks (MLAB) algorithm to solve this issue. Different from structure-based routes, MLAB exploits a strategy that not only extracts local evolutionary information from primary sequences, but also using predicts solvent accessibility. Moreover, the construction about predictors of DNA–protein binding sites wields an ensemble weighted sparse representation model with random under-sampling. To evaluate the performance of MLAB, we conduct comprehensive experiments of DNA–protein binding sites prediction. MLAB gives [Formula: see text] of [Formula: see text] , [Formula: see text] , [Formula: see text] and [Formula: see text] on PDNA-543, PDNA-41, PDNA-316 and PDNA-52 datasets, respectively. It shows that MLAB gains advantages by comparing with other outstanding methods. [Formula: see text] for our method is increased by at least [Formula: see text] , [Formula: see text] and [Formula: see text] on PDNA-543, PDNA-41 and PDNA-316 datasets, respectively.
format Online
Article
Text
id pubmed-6149935
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-61499352018-11-13 Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information Shen, Cong Ding, Yijie Tang, Jijun Song, Jian Guo, Fei Molecules Article DNA–protein interactions appear as pivotal roles in diverse biological procedures and are paramount for cell metabolism, while identifying them with computational means is a kind of prudent scenario in depleting in vitro and in vivo experimental charging. A variety of state-of-the-art investigations have been elucidated to improve the accuracy of the DNA–protein binding sites prediction. Nevertheless, structure-based approaches are limited under the condition without 3D information, and the predictive validity is still refinable. In this essay, we address a kind of competitive method called Multi-scale Local Average Blocks (MLAB) algorithm to solve this issue. Different from structure-based routes, MLAB exploits a strategy that not only extracts local evolutionary information from primary sequences, but also using predicts solvent accessibility. Moreover, the construction about predictors of DNA–protein binding sites wields an ensemble weighted sparse representation model with random under-sampling. To evaluate the performance of MLAB, we conduct comprehensive experiments of DNA–protein binding sites prediction. MLAB gives [Formula: see text] of [Formula: see text] , [Formula: see text] , [Formula: see text] and [Formula: see text] on PDNA-543, PDNA-41, PDNA-316 and PDNA-52 datasets, respectively. It shows that MLAB gains advantages by comparing with other outstanding methods. [Formula: see text] for our method is increased by at least [Formula: see text] , [Formula: see text] and [Formula: see text] on PDNA-543, PDNA-41 and PDNA-316 datasets, respectively. MDPI 2017-11-28 /pmc/articles/PMC6149935/ /pubmed/29182548 http://dx.doi.org/10.3390/molecules22122079 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Shen, Cong
Ding, Yijie
Tang, Jijun
Song, Jian
Guo, Fei
Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information
title Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information
title_full Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information
title_fullStr Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information
title_full_unstemmed Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information
title_short Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information
title_sort identification of dna–protein binding sites through multi-scale local average blocks on sequence information
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6149935/
https://www.ncbi.nlm.nih.gov/pubmed/29182548
http://dx.doi.org/10.3390/molecules22122079
work_keys_str_mv AT shencong identificationofdnaproteinbindingsitesthroughmultiscalelocalaverageblocksonsequenceinformation
AT dingyijie identificationofdnaproteinbindingsitesthroughmultiscalelocalaverageblocksonsequenceinformation
AT tangjijun identificationofdnaproteinbindingsitesthroughmultiscalelocalaverageblocksonsequenceinformation
AT songjian identificationofdnaproteinbindingsitesthroughmultiscalelocalaverageblocksonsequenceinformation
AT guofei identificationofdnaproteinbindingsitesthroughmultiscalelocalaverageblocksonsequenceinformation