Cargando…

Estimating the Fraction of Non-Coding RNAs in Mammalian Transcriptomes

Recent studies of mammalian transcriptomes have identified numerous RNA transcripts that do not code for proteins; their identity, however, is largely unknown. Here we explore an approach based on sequence randomness patterns to discern different RNA classes. The relative z-score we use helps identi...

Descripción completa

Detalles Bibliográficos
Autores principales: Xin, Yurong, Quarta, Giulio, Gan, Hin Hark, Schlick, Tamar
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2735967/
https://www.ncbi.nlm.nih.gov/pubmed/19812767
_version_ 1782171289245450240
author Xin, Yurong
Quarta, Giulio
Gan, Hin Hark
Schlick, Tamar
author_facet Xin, Yurong
Quarta, Giulio
Gan, Hin Hark
Schlick, Tamar
author_sort Xin, Yurong
collection PubMed
description Recent studies of mammalian transcriptomes have identified numerous RNA transcripts that do not code for proteins; their identity, however, is largely unknown. Here we explore an approach based on sequence randomness patterns to discern different RNA classes. The relative z-score we use helps identify the known ncRNA class from the genome, intergene and intron classes. This leads us to a fractional ncRNA measure of putative ncRNA datasets which we model as a mixture of genuine ncRNAs and other transcripts derived from genomic, intergenic and intronic sequences. We use this model to analyze six representative datasets identified by the FANTOM3 project and two computational approaches based on comparative analysis (RNAz and EvoFold). Our analysis suggests fewer ncRNAs than estimated by DNA sequencing and comparative analysis, but the verity of our approach and its prediction requires more extensive experimental RNA data.
format Text
id pubmed-2735967
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-27359672009-09-14 Estimating the Fraction of Non-Coding RNAs in Mammalian Transcriptomes Xin, Yurong Quarta, Giulio Gan, Hin Hark Schlick, Tamar Bioinform Biol Insights Original Research Recent studies of mammalian transcriptomes have identified numerous RNA transcripts that do not code for proteins; their identity, however, is largely unknown. Here we explore an approach based on sequence randomness patterns to discern different RNA classes. The relative z-score we use helps identify the known ncRNA class from the genome, intergene and intron classes. This leads us to a fractional ncRNA measure of putative ncRNA datasets which we model as a mixture of genuine ncRNAs and other transcripts derived from genomic, intergenic and intronic sequences. We use this model to analyze six representative datasets identified by the FANTOM3 project and two computational approaches based on comparative analysis (RNAz and EvoFold). Our analysis suggests fewer ncRNAs than estimated by DNA sequencing and comparative analysis, but the verity of our approach and its prediction requires more extensive experimental RNA data. Libertas Academica 2008-03-01 /pmc/articles/PMC2735967/ /pubmed/19812767 Text en Copyright © 2008 The authors. http://creativecommons.org/licenses/by/3.0 This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Original Research
Xin, Yurong
Quarta, Giulio
Gan, Hin Hark
Schlick, Tamar
Estimating the Fraction of Non-Coding RNAs in Mammalian Transcriptomes
title Estimating the Fraction of Non-Coding RNAs in Mammalian Transcriptomes
title_full Estimating the Fraction of Non-Coding RNAs in Mammalian Transcriptomes
title_fullStr Estimating the Fraction of Non-Coding RNAs in Mammalian Transcriptomes
title_full_unstemmed Estimating the Fraction of Non-Coding RNAs in Mammalian Transcriptomes
title_short Estimating the Fraction of Non-Coding RNAs in Mammalian Transcriptomes
title_sort estimating the fraction of non-coding rnas in mammalian transcriptomes
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2735967/
https://www.ncbi.nlm.nih.gov/pubmed/19812767
work_keys_str_mv AT xinyurong estimatingthefractionofnoncodingrnasinmammaliantranscriptomes
AT quartagiulio estimatingthefractionofnoncodingrnasinmammaliantranscriptomes
AT ganhinhark estimatingthefractionofnoncodingrnasinmammaliantranscriptomes
AT schlicktamar estimatingthefractionofnoncodingrnasinmammaliantranscriptomes