Cargando…

Pseudo–Messenger RNA: Phantoms of the Transcriptome

The mammalian transcriptome harbours shadowy entities that resist classification and analysis. In analogy with pseudogenes, we define pseudo–messenger RNA to be RNA molecules that resemble protein-coding mRNA, but cannot encode full-length proteins owing to disruptions of the reading frame. Using a...

Descripción completa

Detalles Bibliográficos
Autores principales: Frith, Martin C, Wilming, Laurens G, Forrest, Alistair, Kawaji, Hideya, Tan, Sin Lam, Wahlestedt, Claes, Bajic, Vladimir B, Kai, Chikatoshi, Kawai, Jun, Carninci, Piero, Hayashizaki, Yoshihide, Bailey, Timothy L, Huminiecki, Lukasz
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1449882/
https://www.ncbi.nlm.nih.gov/pubmed/16683022
http://dx.doi.org/10.1371/journal.pgen.0020023
_version_ 1782127372816875520
author Frith, Martin C
Wilming, Laurens G
Forrest, Alistair
Kawaji, Hideya
Tan, Sin Lam
Wahlestedt, Claes
Bajic, Vladimir B
Kai, Chikatoshi
Kawai, Jun
Carninci, Piero
Hayashizaki, Yoshihide
Bailey, Timothy L
Huminiecki, Lukasz
author_facet Frith, Martin C
Wilming, Laurens G
Forrest, Alistair
Kawaji, Hideya
Tan, Sin Lam
Wahlestedt, Claes
Bajic, Vladimir B
Kai, Chikatoshi
Kawai, Jun
Carninci, Piero
Hayashizaki, Yoshihide
Bailey, Timothy L
Huminiecki, Lukasz
author_sort Frith, Martin C
collection PubMed
description The mammalian transcriptome harbours shadowy entities that resist classification and analysis. In analogy with pseudogenes, we define pseudo–messenger RNA to be RNA molecules that resemble protein-coding mRNA, but cannot encode full-length proteins owing to disruptions of the reading frame. Using a rigorous computational pipeline, which rules out sequencing errors, we identify 10,679 pseudo–messenger RNAs (approximately half of which are transposon-associated) among the 102,801 FANTOM3 mouse cDNAs: just over 10% of the FANTOM3 transcriptome. These comprise not only transcribed pseudogenes, but also disrupted splice variants of otherwise protein-coding genes. Some may encode truncated proteins, only a minority of which appear subject to nonsense-mediated decay. The presence of an excess of transcripts whose only disruptions are opal stop codons suggests that there are more selenoproteins than currently estimated. We also describe compensatory frameshifts, where a segment of the gene has changed frame but remains translatable. In summary, we survey a large class of non-standard but potentially functional transcripts that are likely to encode genetic information and effect biological processes in novel ways. Many of these transcripts do not correspond cleanly to any identifiable object in the genome, implying fundamental limits to the goal of annotating all functional elements at the genome sequence level.
format Text
id pubmed-1449882
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-14498822006-05-08 Pseudo–Messenger RNA: Phantoms of the Transcriptome Frith, Martin C Wilming, Laurens G Forrest, Alistair Kawaji, Hideya Tan, Sin Lam Wahlestedt, Claes Bajic, Vladimir B Kai, Chikatoshi Kawai, Jun Carninci, Piero Hayashizaki, Yoshihide Bailey, Timothy L Huminiecki, Lukasz PLoS Genet Research Article The mammalian transcriptome harbours shadowy entities that resist classification and analysis. In analogy with pseudogenes, we define pseudo–messenger RNA to be RNA molecules that resemble protein-coding mRNA, but cannot encode full-length proteins owing to disruptions of the reading frame. Using a rigorous computational pipeline, which rules out sequencing errors, we identify 10,679 pseudo–messenger RNAs (approximately half of which are transposon-associated) among the 102,801 FANTOM3 mouse cDNAs: just over 10% of the FANTOM3 transcriptome. These comprise not only transcribed pseudogenes, but also disrupted splice variants of otherwise protein-coding genes. Some may encode truncated proteins, only a minority of which appear subject to nonsense-mediated decay. The presence of an excess of transcripts whose only disruptions are opal stop codons suggests that there are more selenoproteins than currently estimated. We also describe compensatory frameshifts, where a segment of the gene has changed frame but remains translatable. In summary, we survey a large class of non-standard but potentially functional transcripts that are likely to encode genetic information and effect biological processes in novel ways. Many of these transcripts do not correspond cleanly to any identifiable object in the genome, implying fundamental limits to the goal of annotating all functional elements at the genome sequence level. Public Library of Science 2006-04 2006-04-28 /pmc/articles/PMC1449882/ /pubmed/16683022 http://dx.doi.org/10.1371/journal.pgen.0020023 Text en © 2006 Frith et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Frith, Martin C
Wilming, Laurens G
Forrest, Alistair
Kawaji, Hideya
Tan, Sin Lam
Wahlestedt, Claes
Bajic, Vladimir B
Kai, Chikatoshi
Kawai, Jun
Carninci, Piero
Hayashizaki, Yoshihide
Bailey, Timothy L
Huminiecki, Lukasz
Pseudo–Messenger RNA: Phantoms of the Transcriptome
title Pseudo–Messenger RNA: Phantoms of the Transcriptome
title_full Pseudo–Messenger RNA: Phantoms of the Transcriptome
title_fullStr Pseudo–Messenger RNA: Phantoms of the Transcriptome
title_full_unstemmed Pseudo–Messenger RNA: Phantoms of the Transcriptome
title_short Pseudo–Messenger RNA: Phantoms of the Transcriptome
title_sort pseudo–messenger rna: phantoms of the transcriptome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1449882/
https://www.ncbi.nlm.nih.gov/pubmed/16683022
http://dx.doi.org/10.1371/journal.pgen.0020023
work_keys_str_mv AT frithmartinc pseudomessengerrnaphantomsofthetranscriptome
AT wilminglaurensg pseudomessengerrnaphantomsofthetranscriptome
AT forrestalistair pseudomessengerrnaphantomsofthetranscriptome
AT kawajihideya pseudomessengerrnaphantomsofthetranscriptome
AT tansinlam pseudomessengerrnaphantomsofthetranscriptome
AT wahlestedtclaes pseudomessengerrnaphantomsofthetranscriptome
AT bajicvladimirb pseudomessengerrnaphantomsofthetranscriptome
AT kaichikatoshi pseudomessengerrnaphantomsofthetranscriptome
AT kawaijun pseudomessengerrnaphantomsofthetranscriptome
AT carnincipiero pseudomessengerrnaphantomsofthetranscriptome
AT hayashizakiyoshihide pseudomessengerrnaphantomsofthetranscriptome
AT baileytimothyl pseudomessengerrnaphantomsofthetranscriptome
AT huminieckilukasz pseudomessengerrnaphantomsofthetranscriptome