Cargando…

Global transcript structure resolution of high gene density genomes through multi-platform data integration

Annotation of herpesvirus genomes has traditionally been undertaken through the detection of open reading frames and other genomic motifs, supplemented with sequencing of individual cDNAs. Second generation sequencing and high-density microarray studies have revealed vastly greater herpesvirus trans...

Descripción completa

Detalles Bibliográficos
Autores principales: O'Grady, Tina, Wang, Xia, Höner zu Bentrup, Kerstin, Baddoo, Melody, Concha, Monica, Flemington, Erik K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5062983/
https://www.ncbi.nlm.nih.gov/pubmed/27407110
http://dx.doi.org/10.1093/nar/gkw629
_version_ 1782459889334878208
author O'Grady, Tina
Wang, Xia
Höner zu Bentrup, Kerstin
Baddoo, Melody
Concha, Monica
Flemington, Erik K.
author_facet O'Grady, Tina
Wang, Xia
Höner zu Bentrup, Kerstin
Baddoo, Melody
Concha, Monica
Flemington, Erik K.
author_sort O'Grady, Tina
collection PubMed
description Annotation of herpesvirus genomes has traditionally been undertaken through the detection of open reading frames and other genomic motifs, supplemented with sequencing of individual cDNAs. Second generation sequencing and high-density microarray studies have revealed vastly greater herpesvirus transcriptome complexity than is captured by existing annotation. The pervasive nature of overlapping transcription throughout herpesvirus genomes, however, poses substantial problems in resolving transcript structures using these methods alone. We present an approach that combines the unique attributes of Pacific Biosciences Iso-Seq long-read, Illumina short-read and deepCAGE (Cap Analysis of Gene Expression) sequencing to globally resolve polyadenylated isoform structures in replicating Epstein-Barr virus (EBV). Our method, Transcriptome Resolution through Integration of Multi-platform Data (TRIMD), identifies nearly 300 novel EBV transcripts, quadrupling the size of the annotated viral transcriptome. These findings illustrate an array of mechanisms through which EBV achieves functional diversity in its relatively small, compact genome including programmed alternative splicing (e.g. across the IR1 repeats), alternative promoter usage by LMP2 and other latency-associated transcripts, intergenic splicing at the BZLF2 locus, and antisense transcription and pervasive readthrough transcription throughout the genome.
format Online
Article
Text
id pubmed-5062983
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-50629832016-10-14 Global transcript structure resolution of high gene density genomes through multi-platform data integration O'Grady, Tina Wang, Xia Höner zu Bentrup, Kerstin Baddoo, Melody Concha, Monica Flemington, Erik K. Nucleic Acids Res Methods Online Annotation of herpesvirus genomes has traditionally been undertaken through the detection of open reading frames and other genomic motifs, supplemented with sequencing of individual cDNAs. Second generation sequencing and high-density microarray studies have revealed vastly greater herpesvirus transcriptome complexity than is captured by existing annotation. The pervasive nature of overlapping transcription throughout herpesvirus genomes, however, poses substantial problems in resolving transcript structures using these methods alone. We present an approach that combines the unique attributes of Pacific Biosciences Iso-Seq long-read, Illumina short-read and deepCAGE (Cap Analysis of Gene Expression) sequencing to globally resolve polyadenylated isoform structures in replicating Epstein-Barr virus (EBV). Our method, Transcriptome Resolution through Integration of Multi-platform Data (TRIMD), identifies nearly 300 novel EBV transcripts, quadrupling the size of the annotated viral transcriptome. These findings illustrate an array of mechanisms through which EBV achieves functional diversity in its relatively small, compact genome including programmed alternative splicing (e.g. across the IR1 repeats), alternative promoter usage by LMP2 and other latency-associated transcripts, intergenic splicing at the BZLF2 locus, and antisense transcription and pervasive readthrough transcription throughout the genome. Oxford University Press 2016-10-14 2016-07-12 /pmc/articles/PMC5062983/ /pubmed/27407110 http://dx.doi.org/10.1093/nar/gkw629 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
O'Grady, Tina
Wang, Xia
Höner zu Bentrup, Kerstin
Baddoo, Melody
Concha, Monica
Flemington, Erik K.
Global transcript structure resolution of high gene density genomes through multi-platform data integration
title Global transcript structure resolution of high gene density genomes through multi-platform data integration
title_full Global transcript structure resolution of high gene density genomes through multi-platform data integration
title_fullStr Global transcript structure resolution of high gene density genomes through multi-platform data integration
title_full_unstemmed Global transcript structure resolution of high gene density genomes through multi-platform data integration
title_short Global transcript structure resolution of high gene density genomes through multi-platform data integration
title_sort global transcript structure resolution of high gene density genomes through multi-platform data integration
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5062983/
https://www.ncbi.nlm.nih.gov/pubmed/27407110
http://dx.doi.org/10.1093/nar/gkw629
work_keys_str_mv AT ogradytina globaltranscriptstructureresolutionofhighgenedensitygenomesthroughmultiplatformdataintegration
AT wangxia globaltranscriptstructureresolutionofhighgenedensitygenomesthroughmultiplatformdataintegration
AT honerzubentrupkerstin globaltranscriptstructureresolutionofhighgenedensitygenomesthroughmultiplatformdataintegration
AT baddoomelody globaltranscriptstructureresolutionofhighgenedensitygenomesthroughmultiplatformdataintegration
AT conchamonica globaltranscriptstructureresolutionofhighgenedensitygenomesthroughmultiplatformdataintegration
AT flemingtonerikk globaltranscriptstructureresolutionofhighgenedensitygenomesthroughmultiplatformdataintegration