Cargando…

Improved Annotation of 3′ Untranslated Regions and Complex Loci by Combination of Strand-Specific Direct RNA Sequencing, RNA-Seq and ESTs

The reference annotations made for a genome sequence provide the framework for all subsequent analyses of the genome. Correct and complete annotation in addition to the underlying genomic sequence is particularly important when interpreting the results of RNA-seq experiments where short sequence rea...

Descripción completa

Detalles Bibliográficos
Autores principales: Schurch, Nicholas J., Cole, Christian, Sherstnev, Alexander, Song, Junfang, Duc, Céline, Storey, Kate G., McLean, W. H. Irwin, Brown, Sara J., Simpson, Gordon G., Barton, Geoffrey J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3983147/
https://www.ncbi.nlm.nih.gov/pubmed/24722185
http://dx.doi.org/10.1371/journal.pone.0094270
_version_ 1782311271578730496
author Schurch, Nicholas J.
Cole, Christian
Sherstnev, Alexander
Song, Junfang
Duc, Céline
Storey, Kate G.
McLean, W. H. Irwin
Brown, Sara J.
Simpson, Gordon G.
Barton, Geoffrey J.
author_facet Schurch, Nicholas J.
Cole, Christian
Sherstnev, Alexander
Song, Junfang
Duc, Céline
Storey, Kate G.
McLean, W. H. Irwin
Brown, Sara J.
Simpson, Gordon G.
Barton, Geoffrey J.
author_sort Schurch, Nicholas J.
collection PubMed
description The reference annotations made for a genome sequence provide the framework for all subsequent analyses of the genome. Correct and complete annotation in addition to the underlying genomic sequence is particularly important when interpreting the results of RNA-seq experiments where short sequence reads are mapped against the genome and assigned to genes according to the annotation. Inconsistencies in annotations between the reference and the experimental system can lead to incorrect interpretation of the effect on RNA expression of an experimental treatment or mutation in the system under study. Until recently, the genome-wide annotation of 3′ untranslated regions received less attention than coding regions and the delineation of intron/exon boundaries. In this paper, data produced for samples in Human, Chicken and A. thaliana by the novel single-molecule, strand-specific, Direct RNA Sequencing technology from Helicos Biosciences which locates 3′ polyadenylation sites to within +/− 2 nt, were combined with archival EST and RNA-Seq data. Nine examples are illustrated where this combination of data allowed: (1) gene and 3′ UTR re-annotation (including extension of one 3′ UTR by 5.9 kb); (2) disentangling of gene expression in complex regions; (3) clearer interpretation of small RNA expression and (4) identification of novel genes. While the specific examples displayed here may become obsolete as genome sequences and their annotations are refined, the principles laid out in this paper will be of general use both to those annotating genomes and those seeking to interpret existing publically available annotations in the context of their own experimental data.
format Online
Article
Text
id pubmed-3983147
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39831472014-04-15 Improved Annotation of 3′ Untranslated Regions and Complex Loci by Combination of Strand-Specific Direct RNA Sequencing, RNA-Seq and ESTs Schurch, Nicholas J. Cole, Christian Sherstnev, Alexander Song, Junfang Duc, Céline Storey, Kate G. McLean, W. H. Irwin Brown, Sara J. Simpson, Gordon G. Barton, Geoffrey J. PLoS One Research Article The reference annotations made for a genome sequence provide the framework for all subsequent analyses of the genome. Correct and complete annotation in addition to the underlying genomic sequence is particularly important when interpreting the results of RNA-seq experiments where short sequence reads are mapped against the genome and assigned to genes according to the annotation. Inconsistencies in annotations between the reference and the experimental system can lead to incorrect interpretation of the effect on RNA expression of an experimental treatment or mutation in the system under study. Until recently, the genome-wide annotation of 3′ untranslated regions received less attention than coding regions and the delineation of intron/exon boundaries. In this paper, data produced for samples in Human, Chicken and A. thaliana by the novel single-molecule, strand-specific, Direct RNA Sequencing technology from Helicos Biosciences which locates 3′ polyadenylation sites to within +/− 2 nt, were combined with archival EST and RNA-Seq data. Nine examples are illustrated where this combination of data allowed: (1) gene and 3′ UTR re-annotation (including extension of one 3′ UTR by 5.9 kb); (2) disentangling of gene expression in complex regions; (3) clearer interpretation of small RNA expression and (4) identification of novel genes. While the specific examples displayed here may become obsolete as genome sequences and their annotations are refined, the principles laid out in this paper will be of general use both to those annotating genomes and those seeking to interpret existing publically available annotations in the context of their own experimental data. Public Library of Science 2014-04-10 /pmc/articles/PMC3983147/ /pubmed/24722185 http://dx.doi.org/10.1371/journal.pone.0094270 Text en © 2014 Schurch et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Schurch, Nicholas J.
Cole, Christian
Sherstnev, Alexander
Song, Junfang
Duc, Céline
Storey, Kate G.
McLean, W. H. Irwin
Brown, Sara J.
Simpson, Gordon G.
Barton, Geoffrey J.
Improved Annotation of 3′ Untranslated Regions and Complex Loci by Combination of Strand-Specific Direct RNA Sequencing, RNA-Seq and ESTs
title Improved Annotation of 3′ Untranslated Regions and Complex Loci by Combination of Strand-Specific Direct RNA Sequencing, RNA-Seq and ESTs
title_full Improved Annotation of 3′ Untranslated Regions and Complex Loci by Combination of Strand-Specific Direct RNA Sequencing, RNA-Seq and ESTs
title_fullStr Improved Annotation of 3′ Untranslated Regions and Complex Loci by Combination of Strand-Specific Direct RNA Sequencing, RNA-Seq and ESTs
title_full_unstemmed Improved Annotation of 3′ Untranslated Regions and Complex Loci by Combination of Strand-Specific Direct RNA Sequencing, RNA-Seq and ESTs
title_short Improved Annotation of 3′ Untranslated Regions and Complex Loci by Combination of Strand-Specific Direct RNA Sequencing, RNA-Seq and ESTs
title_sort improved annotation of 3′ untranslated regions and complex loci by combination of strand-specific direct rna sequencing, rna-seq and ests
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3983147/
https://www.ncbi.nlm.nih.gov/pubmed/24722185
http://dx.doi.org/10.1371/journal.pone.0094270
work_keys_str_mv AT schurchnicholasj improvedannotationof3untranslatedregionsandcomplexlocibycombinationofstrandspecificdirectrnasequencingrnaseqandests
AT colechristian improvedannotationof3untranslatedregionsandcomplexlocibycombinationofstrandspecificdirectrnasequencingrnaseqandests
AT sherstnevalexander improvedannotationof3untranslatedregionsandcomplexlocibycombinationofstrandspecificdirectrnasequencingrnaseqandests
AT songjunfang improvedannotationof3untranslatedregionsandcomplexlocibycombinationofstrandspecificdirectrnasequencingrnaseqandests
AT ducceline improvedannotationof3untranslatedregionsandcomplexlocibycombinationofstrandspecificdirectrnasequencingrnaseqandests
AT storeykateg improvedannotationof3untranslatedregionsandcomplexlocibycombinationofstrandspecificdirectrnasequencingrnaseqandests
AT mcleanwhirwin improvedannotationof3untranslatedregionsandcomplexlocibycombinationofstrandspecificdirectrnasequencingrnaseqandests
AT brownsaraj improvedannotationof3untranslatedregionsandcomplexlocibycombinationofstrandspecificdirectrnasequencingrnaseqandests
AT simpsongordong improvedannotationof3untranslatedregionsandcomplexlocibycombinationofstrandspecificdirectrnasequencingrnaseqandests
AT bartongeoffreyj improvedannotationof3untranslatedregionsandcomplexlocibycombinationofstrandspecificdirectrnasequencingrnaseqandests