Cargando…

DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues

Protein-DNA and protein-RNA interactions are part of many diverse and essential cellular functions and yet most of them remain to be discovered and characterized. Recent research shows that sequence-based predictors of DNA-binding residues accurately find these residues but also cross-predict many R...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Jing, Kurgan, Lukasz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5449545/
https://www.ncbi.nlm.nih.gov/pubmed/28132027
http://dx.doi.org/10.1093/nar/gkx059
_version_ 1783239800575229952
author Yan, Jing
Kurgan, Lukasz
author_facet Yan, Jing
Kurgan, Lukasz
author_sort Yan, Jing
collection PubMed
description Protein-DNA and protein-RNA interactions are part of many diverse and essential cellular functions and yet most of them remain to be discovered and characterized. Recent research shows that sequence-based predictors of DNA-binding residues accurately find these residues but also cross-predict many RNA-binding residues as DNA-binding, and vice versa. Most of these methods are also relatively slow, prohibiting applications on the whole-genome scale. We describe a novel sequence-based method, DRNApred, which accurately and in high-throughput predicts and discriminates between DNA- and RNA-binding residues. DRNApred was designed using a new dataset with both DNA- and RNA-binding proteins, regression that penalizes cross-predictions, and a novel two-layered architecture. DRNApred outperforms state-of-the-art predictors of DNA- or RNA-binding residues on a benchmark test dataset by substantially reducing the cross predictions and predicting arguably higher quality false positives that are located nearby the native binding residues. Moreover, it also more accurately predicts the DNA- and RNA-binding proteins. Application on the human proteome confirms that DRNApred reduces the cross predictions among the native nucleic acid binders. Also, novel putative DNA/RNA-binding proteins that it predicts share similar subcellular locations and residue charge profiles with the known native binding proteins. Webserver of DRNApred is freely available at http://biomine.cs.vcu.edu/servers/DRNApred/.
format Online
Article
Text
id pubmed-5449545
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-54495452017-06-05 DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues Yan, Jing Kurgan, Lukasz Nucleic Acids Res Methods Online Protein-DNA and protein-RNA interactions are part of many diverse and essential cellular functions and yet most of them remain to be discovered and characterized. Recent research shows that sequence-based predictors of DNA-binding residues accurately find these residues but also cross-predict many RNA-binding residues as DNA-binding, and vice versa. Most of these methods are also relatively slow, prohibiting applications on the whole-genome scale. We describe a novel sequence-based method, DRNApred, which accurately and in high-throughput predicts and discriminates between DNA- and RNA-binding residues. DRNApred was designed using a new dataset with both DNA- and RNA-binding proteins, regression that penalizes cross-predictions, and a novel two-layered architecture. DRNApred outperforms state-of-the-art predictors of DNA- or RNA-binding residues on a benchmark test dataset by substantially reducing the cross predictions and predicting arguably higher quality false positives that are located nearby the native binding residues. Moreover, it also more accurately predicts the DNA- and RNA-binding proteins. Application on the human proteome confirms that DRNApred reduces the cross predictions among the native nucleic acid binders. Also, novel putative DNA/RNA-binding proteins that it predicts share similar subcellular locations and residue charge profiles with the known native binding proteins. Webserver of DRNApred is freely available at http://biomine.cs.vcu.edu/servers/DRNApred/. Oxford University Press 2017-06-02 2017-01-28 /pmc/articles/PMC5449545/ /pubmed/28132027 http://dx.doi.org/10.1093/nar/gkx059 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Yan, Jing
Kurgan, Lukasz
DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues
title DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues
title_full DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues
title_fullStr DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues
title_full_unstemmed DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues
title_short DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues
title_sort drnapred, fast sequence-based method that accurately predicts and discriminates dna- and rna-binding residues
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5449545/
https://www.ncbi.nlm.nih.gov/pubmed/28132027
http://dx.doi.org/10.1093/nar/gkx059
work_keys_str_mv AT yanjing drnapredfastsequencebasedmethodthataccuratelypredictsanddiscriminatesdnaandrnabindingresidues
AT kurganlukasz drnapredfastsequencebasedmethodthataccuratelypredictsanddiscriminatesdnaandrnabindingresidues