Cargando…
Global transcript structure resolution of high gene density genomes through multi-platform data integration
Annotation of herpesvirus genomes has traditionally been undertaken through the detection of open reading frames and other genomic motifs, supplemented with sequencing of individual cDNAs. Second generation sequencing and high-density microarray studies have revealed vastly greater herpesvirus trans...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5062983/ https://www.ncbi.nlm.nih.gov/pubmed/27407110 http://dx.doi.org/10.1093/nar/gkw629 |
_version_ | 1782459889334878208 |
---|---|
author | O'Grady, Tina Wang, Xia Höner zu Bentrup, Kerstin Baddoo, Melody Concha, Monica Flemington, Erik K. |
author_facet | O'Grady, Tina Wang, Xia Höner zu Bentrup, Kerstin Baddoo, Melody Concha, Monica Flemington, Erik K. |
author_sort | O'Grady, Tina |
collection | PubMed |
description | Annotation of herpesvirus genomes has traditionally been undertaken through the detection of open reading frames and other genomic motifs, supplemented with sequencing of individual cDNAs. Second generation sequencing and high-density microarray studies have revealed vastly greater herpesvirus transcriptome complexity than is captured by existing annotation. The pervasive nature of overlapping transcription throughout herpesvirus genomes, however, poses substantial problems in resolving transcript structures using these methods alone. We present an approach that combines the unique attributes of Pacific Biosciences Iso-Seq long-read, Illumina short-read and deepCAGE (Cap Analysis of Gene Expression) sequencing to globally resolve polyadenylated isoform structures in replicating Epstein-Barr virus (EBV). Our method, Transcriptome Resolution through Integration of Multi-platform Data (TRIMD), identifies nearly 300 novel EBV transcripts, quadrupling the size of the annotated viral transcriptome. These findings illustrate an array of mechanisms through which EBV achieves functional diversity in its relatively small, compact genome including programmed alternative splicing (e.g. across the IR1 repeats), alternative promoter usage by LMP2 and other latency-associated transcripts, intergenic splicing at the BZLF2 locus, and antisense transcription and pervasive readthrough transcription throughout the genome. |
format | Online Article Text |
id | pubmed-5062983 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-50629832016-10-14 Global transcript structure resolution of high gene density genomes through multi-platform data integration O'Grady, Tina Wang, Xia Höner zu Bentrup, Kerstin Baddoo, Melody Concha, Monica Flemington, Erik K. Nucleic Acids Res Methods Online Annotation of herpesvirus genomes has traditionally been undertaken through the detection of open reading frames and other genomic motifs, supplemented with sequencing of individual cDNAs. Second generation sequencing and high-density microarray studies have revealed vastly greater herpesvirus transcriptome complexity than is captured by existing annotation. The pervasive nature of overlapping transcription throughout herpesvirus genomes, however, poses substantial problems in resolving transcript structures using these methods alone. We present an approach that combines the unique attributes of Pacific Biosciences Iso-Seq long-read, Illumina short-read and deepCAGE (Cap Analysis of Gene Expression) sequencing to globally resolve polyadenylated isoform structures in replicating Epstein-Barr virus (EBV). Our method, Transcriptome Resolution through Integration of Multi-platform Data (TRIMD), identifies nearly 300 novel EBV transcripts, quadrupling the size of the annotated viral transcriptome. These findings illustrate an array of mechanisms through which EBV achieves functional diversity in its relatively small, compact genome including programmed alternative splicing (e.g. across the IR1 repeats), alternative promoter usage by LMP2 and other latency-associated transcripts, intergenic splicing at the BZLF2 locus, and antisense transcription and pervasive readthrough transcription throughout the genome. Oxford University Press 2016-10-14 2016-07-12 /pmc/articles/PMC5062983/ /pubmed/27407110 http://dx.doi.org/10.1093/nar/gkw629 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online O'Grady, Tina Wang, Xia Höner zu Bentrup, Kerstin Baddoo, Melody Concha, Monica Flemington, Erik K. Global transcript structure resolution of high gene density genomes through multi-platform data integration |
title | Global transcript structure resolution of high gene density genomes through multi-platform data integration |
title_full | Global transcript structure resolution of high gene density genomes through multi-platform data integration |
title_fullStr | Global transcript structure resolution of high gene density genomes through multi-platform data integration |
title_full_unstemmed | Global transcript structure resolution of high gene density genomes through multi-platform data integration |
title_short | Global transcript structure resolution of high gene density genomes through multi-platform data integration |
title_sort | global transcript structure resolution of high gene density genomes through multi-platform data integration |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5062983/ https://www.ncbi.nlm.nih.gov/pubmed/27407110 http://dx.doi.org/10.1093/nar/gkw629 |
work_keys_str_mv | AT ogradytina globaltranscriptstructureresolutionofhighgenedensitygenomesthroughmultiplatformdataintegration AT wangxia globaltranscriptstructureresolutionofhighgenedensitygenomesthroughmultiplatformdataintegration AT honerzubentrupkerstin globaltranscriptstructureresolutionofhighgenedensitygenomesthroughmultiplatformdataintegration AT baddoomelody globaltranscriptstructureresolutionofhighgenedensitygenomesthroughmultiplatformdataintegration AT conchamonica globaltranscriptstructureresolutionofhighgenedensitygenomesthroughmultiplatformdataintegration AT flemingtonerikk globaltranscriptstructureresolutionofhighgenedensitygenomesthroughmultiplatformdataintegration |