Cargando…
Improved definition of the mouse transcriptome via targeted RNA sequencing
Targeted RNA sequencing (CaptureSeq) uses oligonucleotide probes to capture RNAs for sequencing, providing enriched read coverage, accurate measurement of gene expression, and quantitative expression data. We applied CaptureSeq to refine transcript annotations in the current murine GRCm38 assembly....
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4864457/ https://www.ncbi.nlm.nih.gov/pubmed/27197243 http://dx.doi.org/10.1101/gr.199760.115 |
_version_ | 1782431630575534080 |
---|---|
author | Bussotti, Giovanni Leonardi, Tommaso Clark, Michael B. Mercer, Tim R. Crawford, Joanna Malquori, Lorenzo Notredame, Cedric Dinger, Marcel E. Mattick, John S. Enright, Anton J. |
author_facet | Bussotti, Giovanni Leonardi, Tommaso Clark, Michael B. Mercer, Tim R. Crawford, Joanna Malquori, Lorenzo Notredame, Cedric Dinger, Marcel E. Mattick, John S. Enright, Anton J. |
author_sort | Bussotti, Giovanni |
collection | PubMed |
description | Targeted RNA sequencing (CaptureSeq) uses oligonucleotide probes to capture RNAs for sequencing, providing enriched read coverage, accurate measurement of gene expression, and quantitative expression data. We applied CaptureSeq to refine transcript annotations in the current murine GRCm38 assembly. More than 23,000 regions corresponding to putative or annotated long noncoding RNAs (lncRNAs) and 154,281 known splicing junction sites were selected for targeted sequencing across five mouse tissues and three brain subregions. The results illustrate that the mouse transcriptome is considerably more complex than previously thought. We assemble more complete transcript isoforms than GENCODE, expand transcript boundaries, and connect interspersed islands of mapped reads. We describe a novel filtering pipeline that identifies previously unannotated but high-quality transcript isoforms. In this set, 911 GENCODE neighboring genes are condensed into 400 expanded gene models. Additionally, 594 GENCODE lncRNAs acquire an open reading frame (ORF) when their structure is extended with CaptureSeq. Finally, we validate our observations using current FANTOM and Mouse ENCODE resources. |
format | Online Article Text |
id | pubmed-4864457 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-48644572016-05-24 Improved definition of the mouse transcriptome via targeted RNA sequencing Bussotti, Giovanni Leonardi, Tommaso Clark, Michael B. Mercer, Tim R. Crawford, Joanna Malquori, Lorenzo Notredame, Cedric Dinger, Marcel E. Mattick, John S. Enright, Anton J. Genome Res Resource Targeted RNA sequencing (CaptureSeq) uses oligonucleotide probes to capture RNAs for sequencing, providing enriched read coverage, accurate measurement of gene expression, and quantitative expression data. We applied CaptureSeq to refine transcript annotations in the current murine GRCm38 assembly. More than 23,000 regions corresponding to putative or annotated long noncoding RNAs (lncRNAs) and 154,281 known splicing junction sites were selected for targeted sequencing across five mouse tissues and three brain subregions. The results illustrate that the mouse transcriptome is considerably more complex than previously thought. We assemble more complete transcript isoforms than GENCODE, expand transcript boundaries, and connect interspersed islands of mapped reads. We describe a novel filtering pipeline that identifies previously unannotated but high-quality transcript isoforms. In this set, 911 GENCODE neighboring genes are condensed into 400 expanded gene models. Additionally, 594 GENCODE lncRNAs acquire an open reading frame (ORF) when their structure is extended with CaptureSeq. Finally, we validate our observations using current FANTOM and Mouse ENCODE resources. Cold Spring Harbor Laboratory Press 2016-05 /pmc/articles/PMC4864457/ /pubmed/27197243 http://dx.doi.org/10.1101/gr.199760.115 Text en © 2016 Bussotti et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by/4.0/ This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Resource Bussotti, Giovanni Leonardi, Tommaso Clark, Michael B. Mercer, Tim R. Crawford, Joanna Malquori, Lorenzo Notredame, Cedric Dinger, Marcel E. Mattick, John S. Enright, Anton J. Improved definition of the mouse transcriptome via targeted RNA sequencing |
title | Improved definition of the mouse transcriptome via targeted RNA sequencing |
title_full | Improved definition of the mouse transcriptome via targeted RNA sequencing |
title_fullStr | Improved definition of the mouse transcriptome via targeted RNA sequencing |
title_full_unstemmed | Improved definition of the mouse transcriptome via targeted RNA sequencing |
title_short | Improved definition of the mouse transcriptome via targeted RNA sequencing |
title_sort | improved definition of the mouse transcriptome via targeted rna sequencing |
topic | Resource |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4864457/ https://www.ncbi.nlm.nih.gov/pubmed/27197243 http://dx.doi.org/10.1101/gr.199760.115 |
work_keys_str_mv | AT bussottigiovanni improveddefinitionofthemousetranscriptomeviatargetedrnasequencing AT leonarditommaso improveddefinitionofthemousetranscriptomeviatargetedrnasequencing AT clarkmichaelb improveddefinitionofthemousetranscriptomeviatargetedrnasequencing AT mercertimr improveddefinitionofthemousetranscriptomeviatargetedrnasequencing AT crawfordjoanna improveddefinitionofthemousetranscriptomeviatargetedrnasequencing AT malquorilorenzo improveddefinitionofthemousetranscriptomeviatargetedrnasequencing AT notredamecedric improveddefinitionofthemousetranscriptomeviatargetedrnasequencing AT dingermarcele improveddefinitionofthemousetranscriptomeviatargetedrnasequencing AT mattickjohns improveddefinitionofthemousetranscriptomeviatargetedrnasequencing AT enrightantonj improveddefinitionofthemousetranscriptomeviatargetedrnasequencing |