Cargando…

Annotation of the Protein Coding Regions of the Equine Genome

Current gene annotation of the horse genome is largely derived from in silico predictions and cross-species alignments. Only a small number of genes are annotated based on equine EST and mRNA sequences. To expand the number of equine genes annotated from equine experimental evidence, we sequenced mR...

Descripción completa

Detalles Bibliográficos
Autores principales: Hestand, Matthew S., Kalbfleisch, Theodore S., Coleman, Stephen J., Zeng, Zheng, Liu, Jinze, Orlando, Ludovic, MacLeod, James N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4481266/
https://www.ncbi.nlm.nih.gov/pubmed/26107351
http://dx.doi.org/10.1371/journal.pone.0124375
_version_ 1782378252302548992
author Hestand, Matthew S.
Kalbfleisch, Theodore S.
Coleman, Stephen J.
Zeng, Zheng
Liu, Jinze
Orlando, Ludovic
MacLeod, James N.
author_facet Hestand, Matthew S.
Kalbfleisch, Theodore S.
Coleman, Stephen J.
Zeng, Zheng
Liu, Jinze
Orlando, Ludovic
MacLeod, James N.
author_sort Hestand, Matthew S.
collection PubMed
description Current gene annotation of the horse genome is largely derived from in silico predictions and cross-species alignments. Only a small number of genes are annotated based on equine EST and mRNA sequences. To expand the number of equine genes annotated from equine experimental evidence, we sequenced mRNA from a pool of forty-three different tissues. From these, we derived the structures of 68,594 transcripts. In addition, we identified 301,829 positions with SNPs or small indels within these transcripts relative to EquCab2. Interestingly, 780 variants extend the open reading frame of the transcript and appear to be small errors in the equine reference genome, since they are also identified as homozygous variants by genomic DNA resequencing of the reference horse. Taken together, we provide a resource of equine mRNA structures and protein coding variants that will enhance equine and cross-species transcriptional and genomic comparisons.
format Online
Article
Text
id pubmed-4481266
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44812662015-06-29 Annotation of the Protein Coding Regions of the Equine Genome Hestand, Matthew S. Kalbfleisch, Theodore S. Coleman, Stephen J. Zeng, Zheng Liu, Jinze Orlando, Ludovic MacLeod, James N. PLoS One Research Article Current gene annotation of the horse genome is largely derived from in silico predictions and cross-species alignments. Only a small number of genes are annotated based on equine EST and mRNA sequences. To expand the number of equine genes annotated from equine experimental evidence, we sequenced mRNA from a pool of forty-three different tissues. From these, we derived the structures of 68,594 transcripts. In addition, we identified 301,829 positions with SNPs or small indels within these transcripts relative to EquCab2. Interestingly, 780 variants extend the open reading frame of the transcript and appear to be small errors in the equine reference genome, since they are also identified as homozygous variants by genomic DNA resequencing of the reference horse. Taken together, we provide a resource of equine mRNA structures and protein coding variants that will enhance equine and cross-species transcriptional and genomic comparisons. Public Library of Science 2015-06-24 /pmc/articles/PMC4481266/ /pubmed/26107351 http://dx.doi.org/10.1371/journal.pone.0124375 Text en © 2015 Hestand et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Hestand, Matthew S.
Kalbfleisch, Theodore S.
Coleman, Stephen J.
Zeng, Zheng
Liu, Jinze
Orlando, Ludovic
MacLeod, James N.
Annotation of the Protein Coding Regions of the Equine Genome
title Annotation of the Protein Coding Regions of the Equine Genome
title_full Annotation of the Protein Coding Regions of the Equine Genome
title_fullStr Annotation of the Protein Coding Regions of the Equine Genome
title_full_unstemmed Annotation of the Protein Coding Regions of the Equine Genome
title_short Annotation of the Protein Coding Regions of the Equine Genome
title_sort annotation of the protein coding regions of the equine genome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4481266/
https://www.ncbi.nlm.nih.gov/pubmed/26107351
http://dx.doi.org/10.1371/journal.pone.0124375
work_keys_str_mv AT hestandmatthews annotationoftheproteincodingregionsoftheequinegenome
AT kalbfleischtheodores annotationoftheproteincodingregionsoftheequinegenome
AT colemanstephenj annotationoftheproteincodingregionsoftheequinegenome
AT zengzheng annotationoftheproteincodingregionsoftheequinegenome
AT liujinze annotationoftheproteincodingregionsoftheequinegenome
AT orlandoludovic annotationoftheproteincodingregionsoftheequinegenome
AT macleodjamesn annotationoftheproteincodingregionsoftheequinegenome