Cargando…

Identification of 3(′) gene ends using transcriptional and genomic conservation across vertebrates

BACKGROUND: In higher eukaryotes, gene expression is regulated at different levels. In particular, 3(′)UTRs play a central role in translation, stability and subcellular localization of transcripts. In recent years, the development of high throughput sequencing techniques has facilitated the acquisi...

Descripción completa

Detalles Bibliográficos
Autores principales: Morgan, Marcos, Iaconcig, Alessandra, Muro, Andrés Fernando
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3564943/
https://www.ncbi.nlm.nih.gov/pubmed/23244605
http://dx.doi.org/10.1186/1471-2164-13-708
_version_ 1782258389808578560
author Morgan, Marcos
Iaconcig, Alessandra
Muro, Andrés Fernando
author_facet Morgan, Marcos
Iaconcig, Alessandra
Muro, Andrés Fernando
author_sort Morgan, Marcos
collection PubMed
description BACKGROUND: In higher eukaryotes, gene expression is regulated at different levels. In particular, 3(′)UTRs play a central role in translation, stability and subcellular localization of transcripts. In recent years, the development of high throughput sequencing techniques has facilitated the acquisition of transcriptional data at a genome wide level. However, annotation of the 3(′) ends of genes is still incomplete, thus limiting the interpretation of the data generated. For example, we have previously reported two different genes, ADD2 and CPEB3, with conserved 3(′)UTR alternative isoforms not annotated in the current versions of Ensembl and RefSeq human databases. RESULTS: In order to evaluate the existence of other conserved 3(′) ends not annotated in these databases we have now used comparative genomics and transcriptomics across several vertebrate species. In general, we have observed that 3(′)UTR conservation is lost after the end of the mature transcript. Using this change in conservation before and after the 3(′) end of the mature transcripts we have shown that many conserved ends were still not annotated. In addition, we used orthologous transcripts to predict 3(′)UTR extensions and validated these predictions using total RNA sequencing data. Finally, we used this method to identify not annotated 3(′) ends in rats and dogs. As a result, we report several hundred novel 3(′)UTR extensions in rats and a few thousand in dogs. CONCLUSIONS: The methods presented here can efficiently facilitate the identification of not-yet-annotated conserved 3(′)UTR extensions. The application of these methods will increase the confidence of orthologous gene models across vertebrates.
format Online
Article
Text
id pubmed-3564943
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35649432013-02-08 Identification of 3(′) gene ends using transcriptional and genomic conservation across vertebrates Morgan, Marcos Iaconcig, Alessandra Muro, Andrés Fernando BMC Genomics Methodology Article BACKGROUND: In higher eukaryotes, gene expression is regulated at different levels. In particular, 3(′)UTRs play a central role in translation, stability and subcellular localization of transcripts. In recent years, the development of high throughput sequencing techniques has facilitated the acquisition of transcriptional data at a genome wide level. However, annotation of the 3(′) ends of genes is still incomplete, thus limiting the interpretation of the data generated. For example, we have previously reported two different genes, ADD2 and CPEB3, with conserved 3(′)UTR alternative isoforms not annotated in the current versions of Ensembl and RefSeq human databases. RESULTS: In order to evaluate the existence of other conserved 3(′) ends not annotated in these databases we have now used comparative genomics and transcriptomics across several vertebrate species. In general, we have observed that 3(′)UTR conservation is lost after the end of the mature transcript. Using this change in conservation before and after the 3(′) end of the mature transcripts we have shown that many conserved ends were still not annotated. In addition, we used orthologous transcripts to predict 3(′)UTR extensions and validated these predictions using total RNA sequencing data. Finally, we used this method to identify not annotated 3(′) ends in rats and dogs. As a result, we report several hundred novel 3(′)UTR extensions in rats and a few thousand in dogs. CONCLUSIONS: The methods presented here can efficiently facilitate the identification of not-yet-annotated conserved 3(′)UTR extensions. The application of these methods will increase the confidence of orthologous gene models across vertebrates. BioMed Central 2012-12-18 /pmc/articles/PMC3564943/ /pubmed/23244605 http://dx.doi.org/10.1186/1471-2164-13-708 Text en Copyright ©2012 Morgan et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Morgan, Marcos
Iaconcig, Alessandra
Muro, Andrés Fernando
Identification of 3(′) gene ends using transcriptional and genomic conservation across vertebrates
title Identification of 3(′) gene ends using transcriptional and genomic conservation across vertebrates
title_full Identification of 3(′) gene ends using transcriptional and genomic conservation across vertebrates
title_fullStr Identification of 3(′) gene ends using transcriptional and genomic conservation across vertebrates
title_full_unstemmed Identification of 3(′) gene ends using transcriptional and genomic conservation across vertebrates
title_short Identification of 3(′) gene ends using transcriptional and genomic conservation across vertebrates
title_sort identification of 3(′) gene ends using transcriptional and genomic conservation across vertebrates
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3564943/
https://www.ncbi.nlm.nih.gov/pubmed/23244605
http://dx.doi.org/10.1186/1471-2164-13-708
work_keys_str_mv AT morganmarcos identificationof3geneendsusingtranscriptionalandgenomicconservationacrossvertebrates
AT iaconcigalessandra identificationof3geneendsusingtranscriptionalandgenomicconservationacrossvertebrates
AT muroandresfernando identificationof3geneendsusingtranscriptionalandgenomicconservationacrossvertebrates