Cargando…

Pervasive Transcription of the Human Genome Produces Thousands of Previously Unidentified Long Intergenic Noncoding RNAs

Known protein coding gene exons compose less than 3% of the human genome. The remaining 97% is largely uncharted territory, with only a small fraction characterized. The recent observation of transcription in this intergenic territory has stimulated debate about the extent of intergenic transcriptio...

Descripción completa

Detalles Bibliográficos
Autores principales: Hangauer, Matthew J., Vaughn, Ian W., McManus, Michael T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3688513/
https://www.ncbi.nlm.nih.gov/pubmed/23818866
http://dx.doi.org/10.1371/journal.pgen.1003569
_version_ 1782476216105697280
author Hangauer, Matthew J.
Vaughn, Ian W.
McManus, Michael T.
author_facet Hangauer, Matthew J.
Vaughn, Ian W.
McManus, Michael T.
author_sort Hangauer, Matthew J.
collection PubMed
description Known protein coding gene exons compose less than 3% of the human genome. The remaining 97% is largely uncharted territory, with only a small fraction characterized. The recent observation of transcription in this intergenic territory has stimulated debate about the extent of intergenic transcription and whether these intergenic RNAs are functional. Here we directly observed with a large set of RNA-seq data covering a wide array of human tissue types that the majority of the genome is indeed transcribed, corroborating recent observations by the ENCODE project. Furthermore, using de novo transcriptome assembly of this RNA-seq data, we found that intergenic regions encode far more long intergenic noncoding RNAs (lincRNAs) than previously described, helping to resolve the discrepancy between the vast amount of observed intergenic transcription and the limited number of previously known lincRNAs. In total, we identified tens of thousands of putative lincRNAs expressed at a minimum of one copy per cell, significantly expanding upon prior lincRNA annotation sets. These lincRNAs are specifically regulated and conserved rather than being the product of transcriptional noise. In addition, lincRNAs are strongly enriched for trait-associated SNPs suggesting a new mechanism by which intergenic trait-associated regions may function. These findings will enable the discovery and interrogation of novel intergenic functional elements.
format Online
Article
Text
id pubmed-3688513
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36885132013-07-01 Pervasive Transcription of the Human Genome Produces Thousands of Previously Unidentified Long Intergenic Noncoding RNAs Hangauer, Matthew J. Vaughn, Ian W. McManus, Michael T. PLoS Genet Research Article Known protein coding gene exons compose less than 3% of the human genome. The remaining 97% is largely uncharted territory, with only a small fraction characterized. The recent observation of transcription in this intergenic territory has stimulated debate about the extent of intergenic transcription and whether these intergenic RNAs are functional. Here we directly observed with a large set of RNA-seq data covering a wide array of human tissue types that the majority of the genome is indeed transcribed, corroborating recent observations by the ENCODE project. Furthermore, using de novo transcriptome assembly of this RNA-seq data, we found that intergenic regions encode far more long intergenic noncoding RNAs (lincRNAs) than previously described, helping to resolve the discrepancy between the vast amount of observed intergenic transcription and the limited number of previously known lincRNAs. In total, we identified tens of thousands of putative lincRNAs expressed at a minimum of one copy per cell, significantly expanding upon prior lincRNA annotation sets. These lincRNAs are specifically regulated and conserved rather than being the product of transcriptional noise. In addition, lincRNAs are strongly enriched for trait-associated SNPs suggesting a new mechanism by which intergenic trait-associated regions may function. These findings will enable the discovery and interrogation of novel intergenic functional elements. Public Library of Science 2013-06-20 /pmc/articles/PMC3688513/ /pubmed/23818866 http://dx.doi.org/10.1371/journal.pgen.1003569 Text en © 2013 Hangauer et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Hangauer, Matthew J.
Vaughn, Ian W.
McManus, Michael T.
Pervasive Transcription of the Human Genome Produces Thousands of Previously Unidentified Long Intergenic Noncoding RNAs
title Pervasive Transcription of the Human Genome Produces Thousands of Previously Unidentified Long Intergenic Noncoding RNAs
title_full Pervasive Transcription of the Human Genome Produces Thousands of Previously Unidentified Long Intergenic Noncoding RNAs
title_fullStr Pervasive Transcription of the Human Genome Produces Thousands of Previously Unidentified Long Intergenic Noncoding RNAs
title_full_unstemmed Pervasive Transcription of the Human Genome Produces Thousands of Previously Unidentified Long Intergenic Noncoding RNAs
title_short Pervasive Transcription of the Human Genome Produces Thousands of Previously Unidentified Long Intergenic Noncoding RNAs
title_sort pervasive transcription of the human genome produces thousands of previously unidentified long intergenic noncoding rnas
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3688513/
https://www.ncbi.nlm.nih.gov/pubmed/23818866
http://dx.doi.org/10.1371/journal.pgen.1003569
work_keys_str_mv AT hangauermatthewj pervasivetranscriptionofthehumangenomeproducesthousandsofpreviouslyunidentifiedlongintergenicnoncodingrnas
AT vaughnianw pervasivetranscriptionofthehumangenomeproducesthousandsofpreviouslyunidentifiedlongintergenicnoncodingrnas
AT mcmanusmichaelt pervasivetranscriptionofthehumangenomeproducesthousandsofpreviouslyunidentifiedlongintergenicnoncodingrnas