Cargando…

Gene structure in the sea urchin Strongylocentrotus purpuratus based on transcriptome analysis

A comprehensive transcriptome analysis has been performed on protein-coding RNAs of Strongylocentrotus purpuratus, including 10 different embryonic stages, six feeding larval and metamorphosed juvenile stages, and six adult tissues. In this study, we pooled the transcriptomes from all of these sourc...

Descripción completa

Detalles Bibliográficos
Autores principales: Tu, Qiang, Cameron, R. Andrew, Worley, Kim C., Gibbs, Richard A., Davidson, Eric H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3460201/
https://www.ncbi.nlm.nih.gov/pubmed/22709795
http://dx.doi.org/10.1101/gr.139170.112
_version_ 1782244924624732160
author Tu, Qiang
Cameron, R. Andrew
Worley, Kim C.
Gibbs, Richard A.
Davidson, Eric H.
author_facet Tu, Qiang
Cameron, R. Andrew
Worley, Kim C.
Gibbs, Richard A.
Davidson, Eric H.
author_sort Tu, Qiang
collection PubMed
description A comprehensive transcriptome analysis has been performed on protein-coding RNAs of Strongylocentrotus purpuratus, including 10 different embryonic stages, six feeding larval and metamorphosed juvenile stages, and six adult tissues. In this study, we pooled the transcriptomes from all of these sources and focused on the insights they provide for gene structure in the genome of this recently sequenced model system. The genome had initially been annotated by use of computational gene model prediction algorithms. A large fraction of these predicted genes were recovered in the transcriptome when the reads were mapped to the genome and appropriately filtered and analyzed. However, in a manually curated subset, we discovered that more than half the computational gene model predictions were imperfect, containing errors such as missing exons, prediction of nonexistent exons, erroneous intron/exon boundaries, fusion of adjacent genes, and prediction of multiple genes from single genes. The transcriptome data have been used to provide a systematic upgrade of the gene model predictions throughout the genome, very greatly improving the research usability of the genomic sequence. We have constructed new public databases that incorporate information from the transcriptome analyses. The transcript-based gene model data were used to define average structural parameters for S. purpuratus protein-coding genes. In addition, we constructed a custom sea urchin gene ontology, and assigned about 7000 different annotated transcripts to 24 functional classes. Strong correlations became evident between given functional ontology classes and structural properties, including gene size, exon number, and exon and intron size.
format Online
Article
Text
id pubmed-3460201
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-34602012013-04-01 Gene structure in the sea urchin Strongylocentrotus purpuratus based on transcriptome analysis Tu, Qiang Cameron, R. Andrew Worley, Kim C. Gibbs, Richard A. Davidson, Eric H. Genome Res Resource A comprehensive transcriptome analysis has been performed on protein-coding RNAs of Strongylocentrotus purpuratus, including 10 different embryonic stages, six feeding larval and metamorphosed juvenile stages, and six adult tissues. In this study, we pooled the transcriptomes from all of these sources and focused on the insights they provide for gene structure in the genome of this recently sequenced model system. The genome had initially been annotated by use of computational gene model prediction algorithms. A large fraction of these predicted genes were recovered in the transcriptome when the reads were mapped to the genome and appropriately filtered and analyzed. However, in a manually curated subset, we discovered that more than half the computational gene model predictions were imperfect, containing errors such as missing exons, prediction of nonexistent exons, erroneous intron/exon boundaries, fusion of adjacent genes, and prediction of multiple genes from single genes. The transcriptome data have been used to provide a systematic upgrade of the gene model predictions throughout the genome, very greatly improving the research usability of the genomic sequence. We have constructed new public databases that incorporate information from the transcriptome analyses. The transcript-based gene model data were used to define average structural parameters for S. purpuratus protein-coding genes. In addition, we constructed a custom sea urchin gene ontology, and assigned about 7000 different annotated transcripts to 24 functional classes. Strong correlations became evident between given functional ontology classes and structural properties, including gene size, exon number, and exon and intron size. Cold Spring Harbor Laboratory Press 2012-10 /pmc/articles/PMC3460201/ /pubmed/22709795 http://dx.doi.org/10.1101/gr.139170.112 Text en © 2012, Published by Cold Spring Harbor Laboratory Press This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported License), as described at http://creativecommons.org/licenses/by-nc/3.0/.
spellingShingle Resource
Tu, Qiang
Cameron, R. Andrew
Worley, Kim C.
Gibbs, Richard A.
Davidson, Eric H.
Gene structure in the sea urchin Strongylocentrotus purpuratus based on transcriptome analysis
title Gene structure in the sea urchin Strongylocentrotus purpuratus based on transcriptome analysis
title_full Gene structure in the sea urchin Strongylocentrotus purpuratus based on transcriptome analysis
title_fullStr Gene structure in the sea urchin Strongylocentrotus purpuratus based on transcriptome analysis
title_full_unstemmed Gene structure in the sea urchin Strongylocentrotus purpuratus based on transcriptome analysis
title_short Gene structure in the sea urchin Strongylocentrotus purpuratus based on transcriptome analysis
title_sort gene structure in the sea urchin strongylocentrotus purpuratus based on transcriptome analysis
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3460201/
https://www.ncbi.nlm.nih.gov/pubmed/22709795
http://dx.doi.org/10.1101/gr.139170.112
work_keys_str_mv AT tuqiang genestructureintheseaurchinstrongylocentrotuspurpuratusbasedontranscriptomeanalysis
AT cameronrandrew genestructureintheseaurchinstrongylocentrotuspurpuratusbasedontranscriptomeanalysis
AT worleykimc genestructureintheseaurchinstrongylocentrotuspurpuratusbasedontranscriptomeanalysis
AT gibbsricharda genestructureintheseaurchinstrongylocentrotuspurpuratusbasedontranscriptomeanalysis
AT davidsonerich genestructureintheseaurchinstrongylocentrotuspurpuratusbasedontranscriptomeanalysis