Cargando…
DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues
Protein-DNA and protein-RNA interactions are part of many diverse and essential cellular functions and yet most of them remain to be discovered and characterized. Recent research shows that sequence-based predictors of DNA-binding residues accurately find these residues but also cross-predict many R...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5449545/ https://www.ncbi.nlm.nih.gov/pubmed/28132027 http://dx.doi.org/10.1093/nar/gkx059 |
_version_ | 1783239800575229952 |
---|---|
author | Yan, Jing Kurgan, Lukasz |
author_facet | Yan, Jing Kurgan, Lukasz |
author_sort | Yan, Jing |
collection | PubMed |
description | Protein-DNA and protein-RNA interactions are part of many diverse and essential cellular functions and yet most of them remain to be discovered and characterized. Recent research shows that sequence-based predictors of DNA-binding residues accurately find these residues but also cross-predict many RNA-binding residues as DNA-binding, and vice versa. Most of these methods are also relatively slow, prohibiting applications on the whole-genome scale. We describe a novel sequence-based method, DRNApred, which accurately and in high-throughput predicts and discriminates between DNA- and RNA-binding residues. DRNApred was designed using a new dataset with both DNA- and RNA-binding proteins, regression that penalizes cross-predictions, and a novel two-layered architecture. DRNApred outperforms state-of-the-art predictors of DNA- or RNA-binding residues on a benchmark test dataset by substantially reducing the cross predictions and predicting arguably higher quality false positives that are located nearby the native binding residues. Moreover, it also more accurately predicts the DNA- and RNA-binding proteins. Application on the human proteome confirms that DRNApred reduces the cross predictions among the native nucleic acid binders. Also, novel putative DNA/RNA-binding proteins that it predicts share similar subcellular locations and residue charge profiles with the known native binding proteins. Webserver of DRNApred is freely available at http://biomine.cs.vcu.edu/servers/DRNApred/. |
format | Online Article Text |
id | pubmed-5449545 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-54495452017-06-05 DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues Yan, Jing Kurgan, Lukasz Nucleic Acids Res Methods Online Protein-DNA and protein-RNA interactions are part of many diverse and essential cellular functions and yet most of them remain to be discovered and characterized. Recent research shows that sequence-based predictors of DNA-binding residues accurately find these residues but also cross-predict many RNA-binding residues as DNA-binding, and vice versa. Most of these methods are also relatively slow, prohibiting applications on the whole-genome scale. We describe a novel sequence-based method, DRNApred, which accurately and in high-throughput predicts and discriminates between DNA- and RNA-binding residues. DRNApred was designed using a new dataset with both DNA- and RNA-binding proteins, regression that penalizes cross-predictions, and a novel two-layered architecture. DRNApred outperforms state-of-the-art predictors of DNA- or RNA-binding residues on a benchmark test dataset by substantially reducing the cross predictions and predicting arguably higher quality false positives that are located nearby the native binding residues. Moreover, it also more accurately predicts the DNA- and RNA-binding proteins. Application on the human proteome confirms that DRNApred reduces the cross predictions among the native nucleic acid binders. Also, novel putative DNA/RNA-binding proteins that it predicts share similar subcellular locations and residue charge profiles with the known native binding proteins. Webserver of DRNApred is freely available at http://biomine.cs.vcu.edu/servers/DRNApred/. Oxford University Press 2017-06-02 2017-01-28 /pmc/articles/PMC5449545/ /pubmed/28132027 http://dx.doi.org/10.1093/nar/gkx059 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online Yan, Jing Kurgan, Lukasz DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues |
title | DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues |
title_full | DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues |
title_fullStr | DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues |
title_full_unstemmed | DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues |
title_short | DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues |
title_sort | drnapred, fast sequence-based method that accurately predicts and discriminates dna- and rna-binding residues |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5449545/ https://www.ncbi.nlm.nih.gov/pubmed/28132027 http://dx.doi.org/10.1093/nar/gkx059 |
work_keys_str_mv | AT yanjing drnapredfastsequencebasedmethodthataccuratelypredictsanddiscriminatesdnaandrnabindingresidues AT kurganlukasz drnapredfastsequencebasedmethodthataccuratelypredictsanddiscriminatesdnaandrnabindingresidues |