Cargando…

Identification of 3(′) gene ends using transcriptional and genomic conservation across vertebrates

BACKGROUND: In higher eukaryotes, gene expression is regulated at different levels. In particular, 3(′)UTRs play a central role in translation, stability and subcellular localization of transcripts. In recent years, the development of high throughput sequencing techniques has facilitated the acquisi...

Descripción completa

Detalles Bibliográficos
Autores principales: Morgan, Marcos, Iaconcig, Alessandra, Muro, Andrés Fernando
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3564943/
https://www.ncbi.nlm.nih.gov/pubmed/23244605
http://dx.doi.org/10.1186/1471-2164-13-708
Descripción
Sumario:BACKGROUND: In higher eukaryotes, gene expression is regulated at different levels. In particular, 3(′)UTRs play a central role in translation, stability and subcellular localization of transcripts. In recent years, the development of high throughput sequencing techniques has facilitated the acquisition of transcriptional data at a genome wide level. However, annotation of the 3(′) ends of genes is still incomplete, thus limiting the interpretation of the data generated. For example, we have previously reported two different genes, ADD2 and CPEB3, with conserved 3(′)UTR alternative isoforms not annotated in the current versions of Ensembl and RefSeq human databases. RESULTS: In order to evaluate the existence of other conserved 3(′) ends not annotated in these databases we have now used comparative genomics and transcriptomics across several vertebrate species. In general, we have observed that 3(′)UTR conservation is lost after the end of the mature transcript. Using this change in conservation before and after the 3(′) end of the mature transcripts we have shown that many conserved ends were still not annotated. In addition, we used orthologous transcripts to predict 3(′)UTR extensions and validated these predictions using total RNA sequencing data. Finally, we used this method to identify not annotated 3(′) ends in rats and dogs. As a result, we report several hundred novel 3(′)UTR extensions in rats and a few thousand in dogs. CONCLUSIONS: The methods presented here can efficiently facilitate the identification of not-yet-annotated conserved 3(′)UTR extensions. The application of these methods will increase the confidence of orthologous gene models across vertebrates.