Cargando…

Improved definition of the mouse transcriptome via targeted RNA sequencing

Targeted RNA sequencing (CaptureSeq) uses oligonucleotide probes to capture RNAs for sequencing, providing enriched read coverage, accurate measurement of gene expression, and quantitative expression data. We applied CaptureSeq to refine transcript annotations in the current murine GRCm38 assembly....

Descripción completa

Detalles Bibliográficos
Autores principales: Bussotti, Giovanni, Leonardi, Tommaso, Clark, Michael B., Mercer, Tim R., Crawford, Joanna, Malquori, Lorenzo, Notredame, Cedric, Dinger, Marcel E., Mattick, John S., Enright, Anton J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4864457/
https://www.ncbi.nlm.nih.gov/pubmed/27197243
http://dx.doi.org/10.1101/gr.199760.115
_version_ 1782431630575534080
author Bussotti, Giovanni
Leonardi, Tommaso
Clark, Michael B.
Mercer, Tim R.
Crawford, Joanna
Malquori, Lorenzo
Notredame, Cedric
Dinger, Marcel E.
Mattick, John S.
Enright, Anton J.
author_facet Bussotti, Giovanni
Leonardi, Tommaso
Clark, Michael B.
Mercer, Tim R.
Crawford, Joanna
Malquori, Lorenzo
Notredame, Cedric
Dinger, Marcel E.
Mattick, John S.
Enright, Anton J.
author_sort Bussotti, Giovanni
collection PubMed
description Targeted RNA sequencing (CaptureSeq) uses oligonucleotide probes to capture RNAs for sequencing, providing enriched read coverage, accurate measurement of gene expression, and quantitative expression data. We applied CaptureSeq to refine transcript annotations in the current murine GRCm38 assembly. More than 23,000 regions corresponding to putative or annotated long noncoding RNAs (lncRNAs) and 154,281 known splicing junction sites were selected for targeted sequencing across five mouse tissues and three brain subregions. The results illustrate that the mouse transcriptome is considerably more complex than previously thought. We assemble more complete transcript isoforms than GENCODE, expand transcript boundaries, and connect interspersed islands of mapped reads. We describe a novel filtering pipeline that identifies previously unannotated but high-quality transcript isoforms. In this set, 911 GENCODE neighboring genes are condensed into 400 expanded gene models. Additionally, 594 GENCODE lncRNAs acquire an open reading frame (ORF) when their structure is extended with CaptureSeq. Finally, we validate our observations using current FANTOM and Mouse ENCODE resources.
format Online
Article
Text
id pubmed-4864457
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-48644572016-05-24 Improved definition of the mouse transcriptome via targeted RNA sequencing Bussotti, Giovanni Leonardi, Tommaso Clark, Michael B. Mercer, Tim R. Crawford, Joanna Malquori, Lorenzo Notredame, Cedric Dinger, Marcel E. Mattick, John S. Enright, Anton J. Genome Res Resource Targeted RNA sequencing (CaptureSeq) uses oligonucleotide probes to capture RNAs for sequencing, providing enriched read coverage, accurate measurement of gene expression, and quantitative expression data. We applied CaptureSeq to refine transcript annotations in the current murine GRCm38 assembly. More than 23,000 regions corresponding to putative or annotated long noncoding RNAs (lncRNAs) and 154,281 known splicing junction sites were selected for targeted sequencing across five mouse tissues and three brain subregions. The results illustrate that the mouse transcriptome is considerably more complex than previously thought. We assemble more complete transcript isoforms than GENCODE, expand transcript boundaries, and connect interspersed islands of mapped reads. We describe a novel filtering pipeline that identifies previously unannotated but high-quality transcript isoforms. In this set, 911 GENCODE neighboring genes are condensed into 400 expanded gene models. Additionally, 594 GENCODE lncRNAs acquire an open reading frame (ORF) when their structure is extended with CaptureSeq. Finally, we validate our observations using current FANTOM and Mouse ENCODE resources. Cold Spring Harbor Laboratory Press 2016-05 /pmc/articles/PMC4864457/ /pubmed/27197243 http://dx.doi.org/10.1101/gr.199760.115 Text en © 2016 Bussotti et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by/4.0/ This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.
spellingShingle Resource
Bussotti, Giovanni
Leonardi, Tommaso
Clark, Michael B.
Mercer, Tim R.
Crawford, Joanna
Malquori, Lorenzo
Notredame, Cedric
Dinger, Marcel E.
Mattick, John S.
Enright, Anton J.
Improved definition of the mouse transcriptome via targeted RNA sequencing
title Improved definition of the mouse transcriptome via targeted RNA sequencing
title_full Improved definition of the mouse transcriptome via targeted RNA sequencing
title_fullStr Improved definition of the mouse transcriptome via targeted RNA sequencing
title_full_unstemmed Improved definition of the mouse transcriptome via targeted RNA sequencing
title_short Improved definition of the mouse transcriptome via targeted RNA sequencing
title_sort improved definition of the mouse transcriptome via targeted rna sequencing
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4864457/
https://www.ncbi.nlm.nih.gov/pubmed/27197243
http://dx.doi.org/10.1101/gr.199760.115
work_keys_str_mv AT bussottigiovanni improveddefinitionofthemousetranscriptomeviatargetedrnasequencing
AT leonarditommaso improveddefinitionofthemousetranscriptomeviatargetedrnasequencing
AT clarkmichaelb improveddefinitionofthemousetranscriptomeviatargetedrnasequencing
AT mercertimr improveddefinitionofthemousetranscriptomeviatargetedrnasequencing
AT crawfordjoanna improveddefinitionofthemousetranscriptomeviatargetedrnasequencing
AT malquorilorenzo improveddefinitionofthemousetranscriptomeviatargetedrnasequencing
AT notredamecedric improveddefinitionofthemousetranscriptomeviatargetedrnasequencing
AT dingermarcele improveddefinitionofthemousetranscriptomeviatargetedrnasequencing
AT mattickjohns improveddefinitionofthemousetranscriptomeviatargetedrnasequencing
AT enrightantonj improveddefinitionofthemousetranscriptomeviatargetedrnasequencing