Cargando…
RDDpred: a condition-specific RNA-editing prediction model from RNA-seq data
BACKGROUND: RNA-editing is an important post-transcriptional RNA sequence modification performed by two catalytic enzymes, "ADAR"(A-to-I) and "APOBEC"(C-to-U). By utilizing high-throughput sequencing technologies, the biological function of RNA-editing has been actively investiga...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895604/ https://www.ncbi.nlm.nih.gov/pubmed/26817607 http://dx.doi.org/10.1186/s12864-015-2301-y |
_version_ | 1782435884037046272 |
---|---|
author | Kim, Min-su Hur, Benjamin Kim, Sun |
author_facet | Kim, Min-su Hur, Benjamin Kim, Sun |
author_sort | Kim, Min-su |
collection | PubMed |
description | BACKGROUND: RNA-editing is an important post-transcriptional RNA sequence modification performed by two catalytic enzymes, "ADAR"(A-to-I) and "APOBEC"(C-to-U). By utilizing high-throughput sequencing technologies, the biological function of RNA-editing has been actively investigated. Currently, RNA-editing is considered to be a key regulator that controls various cellular functions, such as protein activity, alternative splicing pattern of mRNA, and substitution of miRNA targeting site. DARNED, a public RDD database, reported that there are more than 300-thousands RNA-editing sites detected in human genome(hg19). Moreover, multiple studies suggested that RNA-editing events occur in highly specific conditions. According to DARNED, 97.62 % of registered editing sites were detected in a single tissue or in a specific condition, which also supports that the RNA-editing events occur condition-specifically. Since RNA-seq can capture the whole landscape of transcriptome, RNA-seq is widely used for RDD prediction. However, significant amounts of false positives or artefacts can be generated when detecting RNA-editing from RNA-seq. Since it is difficult to perform experimental validation at the whole-transcriptome scale, there should be a powerful computational tool to distinguish true RNA-editing events from artefacts. RESULT: We developed RDDpred, a Random Forest RDD classifier. RDDpred reports potentially true RNA-editing events from RNA-seq data. RDDpred was tested with two publicly available RNA-editing datasets and successfully reproduced RDDs reported in the two studies (90 %, 95 %) while rejecting false-discoveries (NPV: 75 %, 84 %). CONCLUSION: RDDpred automatically compiles condition-specific training examples without experimental validations and then construct a RDD classifier. As far as we know, RDDpred is the very first machine-learning based automated pipeline for RDD prediction. We believe that RDDpred will be very useful and can contribute significantly to the study of condition-specific RNA-editing. RDDpred is available at http://biohealth.snu.ac.kr/software/RDDpred. |
format | Online Article Text |
id | pubmed-4895604 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-48956042016-06-10 RDDpred: a condition-specific RNA-editing prediction model from RNA-seq data Kim, Min-su Hur, Benjamin Kim, Sun BMC Genomics Proceedings BACKGROUND: RNA-editing is an important post-transcriptional RNA sequence modification performed by two catalytic enzymes, "ADAR"(A-to-I) and "APOBEC"(C-to-U). By utilizing high-throughput sequencing technologies, the biological function of RNA-editing has been actively investigated. Currently, RNA-editing is considered to be a key regulator that controls various cellular functions, such as protein activity, alternative splicing pattern of mRNA, and substitution of miRNA targeting site. DARNED, a public RDD database, reported that there are more than 300-thousands RNA-editing sites detected in human genome(hg19). Moreover, multiple studies suggested that RNA-editing events occur in highly specific conditions. According to DARNED, 97.62 % of registered editing sites were detected in a single tissue or in a specific condition, which also supports that the RNA-editing events occur condition-specifically. Since RNA-seq can capture the whole landscape of transcriptome, RNA-seq is widely used for RDD prediction. However, significant amounts of false positives or artefacts can be generated when detecting RNA-editing from RNA-seq. Since it is difficult to perform experimental validation at the whole-transcriptome scale, there should be a powerful computational tool to distinguish true RNA-editing events from artefacts. RESULT: We developed RDDpred, a Random Forest RDD classifier. RDDpred reports potentially true RNA-editing events from RNA-seq data. RDDpred was tested with two publicly available RNA-editing datasets and successfully reproduced RDDs reported in the two studies (90 %, 95 %) while rejecting false-discoveries (NPV: 75 %, 84 %). CONCLUSION: RDDpred automatically compiles condition-specific training examples without experimental validations and then construct a RDD classifier. As far as we know, RDDpred is the very first machine-learning based automated pipeline for RDD prediction. We believe that RDDpred will be very useful and can contribute significantly to the study of condition-specific RNA-editing. RDDpred is available at http://biohealth.snu.ac.kr/software/RDDpred. BioMed Central 2016-01-11 /pmc/articles/PMC4895604/ /pubmed/26817607 http://dx.doi.org/10.1186/s12864-015-2301-y Text en © Kim et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Kim, Min-su Hur, Benjamin Kim, Sun RDDpred: a condition-specific RNA-editing prediction model from RNA-seq data |
title | RDDpred: a condition-specific RNA-editing prediction model from RNA-seq data |
title_full | RDDpred: a condition-specific RNA-editing prediction model from RNA-seq data |
title_fullStr | RDDpred: a condition-specific RNA-editing prediction model from RNA-seq data |
title_full_unstemmed | RDDpred: a condition-specific RNA-editing prediction model from RNA-seq data |
title_short | RDDpred: a condition-specific RNA-editing prediction model from RNA-seq data |
title_sort | rddpred: a condition-specific rna-editing prediction model from rna-seq data |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895604/ https://www.ncbi.nlm.nih.gov/pubmed/26817607 http://dx.doi.org/10.1186/s12864-015-2301-y |
work_keys_str_mv | AT kimminsu rddpredaconditionspecificrnaeditingpredictionmodelfromrnaseqdata AT hurbenjamin rddpredaconditionspecificrnaeditingpredictionmodelfromrnaseqdata AT kimsun rddpredaconditionspecificrnaeditingpredictionmodelfromrnaseqdata |