Cargando…

Annotation of the Protein Coding Regions of the Equine Genome

Current gene annotation of the horse genome is largely derived from in silico predictions and cross-species alignments. Only a small number of genes are annotated based on equine EST and mRNA sequences. To expand the number of equine genes annotated from equine experimental evidence, we sequenced mR...

Descripción completa

Detalles Bibliográficos
Autores principales: Hestand, Matthew S., Kalbfleisch, Theodore S., Coleman, Stephen J., Zeng, Zheng, Liu, Jinze, Orlando, Ludovic, MacLeod, James N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4481266/
https://www.ncbi.nlm.nih.gov/pubmed/26107351
http://dx.doi.org/10.1371/journal.pone.0124375
Descripción
Sumario:Current gene annotation of the horse genome is largely derived from in silico predictions and cross-species alignments. Only a small number of genes are annotated based on equine EST and mRNA sequences. To expand the number of equine genes annotated from equine experimental evidence, we sequenced mRNA from a pool of forty-three different tissues. From these, we derived the structures of 68,594 transcripts. In addition, we identified 301,829 positions with SNPs or small indels within these transcripts relative to EquCab2. Interestingly, 780 variants extend the open reading frame of the transcript and appear to be small errors in the equine reference genome, since they are also identified as homozygous variants by genomic DNA resequencing of the reference horse. Taken together, we provide a resource of equine mRNA structures and protein coding variants that will enhance equine and cross-species transcriptional and genomic comparisons.