Cargando…
Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information
DNA–protein interactions appear as pivotal roles in diverse biological procedures and are paramount for cell metabolism, while identifying them with computational means is a kind of prudent scenario in depleting in vitro and in vivo experimental charging. A variety of state-of-the-art investigations...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6149935/ https://www.ncbi.nlm.nih.gov/pubmed/29182548 http://dx.doi.org/10.3390/molecules22122079 |
_version_ | 1783356900625088512 |
---|---|
author | Shen, Cong Ding, Yijie Tang, Jijun Song, Jian Guo, Fei |
author_facet | Shen, Cong Ding, Yijie Tang, Jijun Song, Jian Guo, Fei |
author_sort | Shen, Cong |
collection | PubMed |
description | DNA–protein interactions appear as pivotal roles in diverse biological procedures and are paramount for cell metabolism, while identifying them with computational means is a kind of prudent scenario in depleting in vitro and in vivo experimental charging. A variety of state-of-the-art investigations have been elucidated to improve the accuracy of the DNA–protein binding sites prediction. Nevertheless, structure-based approaches are limited under the condition without 3D information, and the predictive validity is still refinable. In this essay, we address a kind of competitive method called Multi-scale Local Average Blocks (MLAB) algorithm to solve this issue. Different from structure-based routes, MLAB exploits a strategy that not only extracts local evolutionary information from primary sequences, but also using predicts solvent accessibility. Moreover, the construction about predictors of DNA–protein binding sites wields an ensemble weighted sparse representation model with random under-sampling. To evaluate the performance of MLAB, we conduct comprehensive experiments of DNA–protein binding sites prediction. MLAB gives [Formula: see text] of [Formula: see text] , [Formula: see text] , [Formula: see text] and [Formula: see text] on PDNA-543, PDNA-41, PDNA-316 and PDNA-52 datasets, respectively. It shows that MLAB gains advantages by comparing with other outstanding methods. [Formula: see text] for our method is increased by at least [Formula: see text] , [Formula: see text] and [Formula: see text] on PDNA-543, PDNA-41 and PDNA-316 datasets, respectively. |
format | Online Article Text |
id | pubmed-6149935 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-61499352018-11-13 Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information Shen, Cong Ding, Yijie Tang, Jijun Song, Jian Guo, Fei Molecules Article DNA–protein interactions appear as pivotal roles in diverse biological procedures and are paramount for cell metabolism, while identifying them with computational means is a kind of prudent scenario in depleting in vitro and in vivo experimental charging. A variety of state-of-the-art investigations have been elucidated to improve the accuracy of the DNA–protein binding sites prediction. Nevertheless, structure-based approaches are limited under the condition without 3D information, and the predictive validity is still refinable. In this essay, we address a kind of competitive method called Multi-scale Local Average Blocks (MLAB) algorithm to solve this issue. Different from structure-based routes, MLAB exploits a strategy that not only extracts local evolutionary information from primary sequences, but also using predicts solvent accessibility. Moreover, the construction about predictors of DNA–protein binding sites wields an ensemble weighted sparse representation model with random under-sampling. To evaluate the performance of MLAB, we conduct comprehensive experiments of DNA–protein binding sites prediction. MLAB gives [Formula: see text] of [Formula: see text] , [Formula: see text] , [Formula: see text] and [Formula: see text] on PDNA-543, PDNA-41, PDNA-316 and PDNA-52 datasets, respectively. It shows that MLAB gains advantages by comparing with other outstanding methods. [Formula: see text] for our method is increased by at least [Formula: see text] , [Formula: see text] and [Formula: see text] on PDNA-543, PDNA-41 and PDNA-316 datasets, respectively. MDPI 2017-11-28 /pmc/articles/PMC6149935/ /pubmed/29182548 http://dx.doi.org/10.3390/molecules22122079 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Shen, Cong Ding, Yijie Tang, Jijun Song, Jian Guo, Fei Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information |
title | Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information |
title_full | Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information |
title_fullStr | Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information |
title_full_unstemmed | Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information |
title_short | Identification of DNA–protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information |
title_sort | identification of dna–protein binding sites through multi-scale local average blocks on sequence information |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6149935/ https://www.ncbi.nlm.nih.gov/pubmed/29182548 http://dx.doi.org/10.3390/molecules22122079 |
work_keys_str_mv | AT shencong identificationofdnaproteinbindingsitesthroughmultiscalelocalaverageblocksonsequenceinformation AT dingyijie identificationofdnaproteinbindingsitesthroughmultiscalelocalaverageblocksonsequenceinformation AT tangjijun identificationofdnaproteinbindingsitesthroughmultiscalelocalaverageblocksonsequenceinformation AT songjian identificationofdnaproteinbindingsitesthroughmultiscalelocalaverageblocksonsequenceinformation AT guofei identificationofdnaproteinbindingsitesthroughmultiscalelocalaverageblocksonsequenceinformation |