Cargando…

An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts

The domestic dog, Canis familiaris, is a well-established model system for mapping trait and disease loci. While the original draft sequence was of good quality, gaps were abundant particularly in promoter regions of the genome, negatively impacting the annotation and study of candidate genes. Here,...

Descripción completa

Detalles Bibliográficos
Autores principales: Hoeppner, Marc P., Lundquist, Andrew, Pirun, Mono, Meadows, Jennifer R. S., Zamani, Neda, Johnson, Jeremy, Sundström, Görel, Cook, April, FitzGerald, Michael G., Swofford, Ross, Mauceli, Evan, Moghadam, Behrooz Torabi, Greka, Anna, Alföldi, Jessica, Abouelleil, Amr, Aftuck, Lynne, Bessette, Daniel, Berlin, Aaron, Brown, Adam, Gearin, Gary, Lui, Annie, Macdonald, J. Pendexter, Priest, Margaret, Shea, Terrance, Turner-Maier, Jason, Zimmer, Andrew, Lander, Eric S., di Palma, Federica, Lindblad-Toh, Kerstin, Grabherr, Manfred G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3953330/
https://www.ncbi.nlm.nih.gov/pubmed/24625832
http://dx.doi.org/10.1371/journal.pone.0091172
_version_ 1782307334008078336
author Hoeppner, Marc P.
Lundquist, Andrew
Pirun, Mono
Meadows, Jennifer R. S.
Zamani, Neda
Johnson, Jeremy
Sundström, Görel
Cook, April
FitzGerald, Michael G.
Swofford, Ross
Mauceli, Evan
Moghadam, Behrooz Torabi
Greka, Anna
Alföldi, Jessica
Abouelleil, Amr
Aftuck, Lynne
Bessette, Daniel
Berlin, Aaron
Brown, Adam
Gearin, Gary
Lui, Annie
Macdonald, J. Pendexter
Priest, Margaret
Shea, Terrance
Turner-Maier, Jason
Zimmer, Andrew
Lander, Eric S.
di Palma, Federica
Lindblad-Toh, Kerstin
Grabherr, Manfred G.
author_facet Hoeppner, Marc P.
Lundquist, Andrew
Pirun, Mono
Meadows, Jennifer R. S.
Zamani, Neda
Johnson, Jeremy
Sundström, Görel
Cook, April
FitzGerald, Michael G.
Swofford, Ross
Mauceli, Evan
Moghadam, Behrooz Torabi
Greka, Anna
Alföldi, Jessica
Abouelleil, Amr
Aftuck, Lynne
Bessette, Daniel
Berlin, Aaron
Brown, Adam
Gearin, Gary
Lui, Annie
Macdonald, J. Pendexter
Priest, Margaret
Shea, Terrance
Turner-Maier, Jason
Zimmer, Andrew
Lander, Eric S.
di Palma, Federica
Lindblad-Toh, Kerstin
Grabherr, Manfred G.
author_sort Hoeppner, Marc P.
collection PubMed
description The domestic dog, Canis familiaris, is a well-established model system for mapping trait and disease loci. While the original draft sequence was of good quality, gaps were abundant particularly in promoter regions of the genome, negatively impacting the annotation and study of candidate genes. Here, we present an improved genome build, canFam3.1, which includes 85 MB of novel sequence and now covers 99.8% of the euchromatic portion of the genome. We also present multiple RNA-Sequencing data sets from 10 different canine tissues to catalog ∼175,000 expressed loci. While about 90% of the coding genes previously annotated by EnsEMBL have measurable expression in at least one sample, the number of transcript isoforms detected by our data expands the EnsEMBL annotations by a factor of four. Syntenic comparison with the human genome revealed an additional ∼3,000 loci that are characterized as protein coding in human and were also expressed in the dog, suggesting that those were previously not annotated in the EnsEMBL canine gene set. In addition to ∼20,700 high-confidence protein coding loci, we found ∼4,600 antisense transcripts overlapping exons of protein coding genes, ∼7,200 intergenic multi-exon transcripts without coding potential, likely candidates for long intergenic non-coding RNAs (lincRNAs) and ∼11,000 transcripts were reported by two different library construction methods but did not fit any of the above categories. Of the lincRNAs, about 6,000 have no annotated orthologs in human or mouse. Functional analysis of two novel transcripts with shRNA in a mouse kidney cell line altered cell morphology and motility. All in all, we provide a much-improved annotation of the canine genome and suggest regulatory functions for several of the novel non-coding transcripts.
format Online
Article
Text
id pubmed-3953330
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39533302014-03-18 An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts Hoeppner, Marc P. Lundquist, Andrew Pirun, Mono Meadows, Jennifer R. S. Zamani, Neda Johnson, Jeremy Sundström, Görel Cook, April FitzGerald, Michael G. Swofford, Ross Mauceli, Evan Moghadam, Behrooz Torabi Greka, Anna Alföldi, Jessica Abouelleil, Amr Aftuck, Lynne Bessette, Daniel Berlin, Aaron Brown, Adam Gearin, Gary Lui, Annie Macdonald, J. Pendexter Priest, Margaret Shea, Terrance Turner-Maier, Jason Zimmer, Andrew Lander, Eric S. di Palma, Federica Lindblad-Toh, Kerstin Grabherr, Manfred G. PLoS One Research Article The domestic dog, Canis familiaris, is a well-established model system for mapping trait and disease loci. While the original draft sequence was of good quality, gaps were abundant particularly in promoter regions of the genome, negatively impacting the annotation and study of candidate genes. Here, we present an improved genome build, canFam3.1, which includes 85 MB of novel sequence and now covers 99.8% of the euchromatic portion of the genome. We also present multiple RNA-Sequencing data sets from 10 different canine tissues to catalog ∼175,000 expressed loci. While about 90% of the coding genes previously annotated by EnsEMBL have measurable expression in at least one sample, the number of transcript isoforms detected by our data expands the EnsEMBL annotations by a factor of four. Syntenic comparison with the human genome revealed an additional ∼3,000 loci that are characterized as protein coding in human and were also expressed in the dog, suggesting that those were previously not annotated in the EnsEMBL canine gene set. In addition to ∼20,700 high-confidence protein coding loci, we found ∼4,600 antisense transcripts overlapping exons of protein coding genes, ∼7,200 intergenic multi-exon transcripts without coding potential, likely candidates for long intergenic non-coding RNAs (lincRNAs) and ∼11,000 transcripts were reported by two different library construction methods but did not fit any of the above categories. Of the lincRNAs, about 6,000 have no annotated orthologs in human or mouse. Functional analysis of two novel transcripts with shRNA in a mouse kidney cell line altered cell morphology and motility. All in all, we provide a much-improved annotation of the canine genome and suggest regulatory functions for several of the novel non-coding transcripts. Public Library of Science 2014-03-13 /pmc/articles/PMC3953330/ /pubmed/24625832 http://dx.doi.org/10.1371/journal.pone.0091172 Text en © 2014 Hoeppner et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Hoeppner, Marc P.
Lundquist, Andrew
Pirun, Mono
Meadows, Jennifer R. S.
Zamani, Neda
Johnson, Jeremy
Sundström, Görel
Cook, April
FitzGerald, Michael G.
Swofford, Ross
Mauceli, Evan
Moghadam, Behrooz Torabi
Greka, Anna
Alföldi, Jessica
Abouelleil, Amr
Aftuck, Lynne
Bessette, Daniel
Berlin, Aaron
Brown, Adam
Gearin, Gary
Lui, Annie
Macdonald, J. Pendexter
Priest, Margaret
Shea, Terrance
Turner-Maier, Jason
Zimmer, Andrew
Lander, Eric S.
di Palma, Federica
Lindblad-Toh, Kerstin
Grabherr, Manfred G.
An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts
title An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts
title_full An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts
title_fullStr An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts
title_full_unstemmed An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts
title_short An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts
title_sort improved canine genome and a comprehensive catalogue of coding genes and non-coding transcripts
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3953330/
https://www.ncbi.nlm.nih.gov/pubmed/24625832
http://dx.doi.org/10.1371/journal.pone.0091172
work_keys_str_mv AT hoeppnermarcp animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT lundquistandrew animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT pirunmono animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT meadowsjenniferrs animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT zamanineda animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT johnsonjeremy animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT sundstromgorel animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT cookapril animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT fitzgeraldmichaelg animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT swoffordross animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT maucelievan animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT moghadambehrooztorabi animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT grekaanna animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT alfoldijessica animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT abouelleilamr animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT aftucklynne animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT bessettedaniel animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT berlinaaron animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT brownadam animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT gearingary animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT luiannie animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT macdonaldjpendexter animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT priestmargaret animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT sheaterrance animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT turnermaierjason animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT zimmerandrew animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT landererics animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT dipalmafederica animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT lindbladtohkerstin animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT grabherrmanfredg animprovedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT hoeppnermarcp improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT lundquistandrew improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT pirunmono improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT meadowsjenniferrs improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT zamanineda improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT johnsonjeremy improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT sundstromgorel improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT cookapril improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT fitzgeraldmichaelg improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT swoffordross improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT maucelievan improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT moghadambehrooztorabi improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT grekaanna improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT alfoldijessica improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT abouelleilamr improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT aftucklynne improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT bessettedaniel improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT berlinaaron improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT brownadam improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT gearingary improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT luiannie improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT macdonaldjpendexter improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT priestmargaret improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT sheaterrance improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT turnermaierjason improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT zimmerandrew improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT landererics improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT dipalmafederica improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT lindbladtohkerstin improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts
AT grabherrmanfredg improvedcaninegenomeandacomprehensivecatalogueofcodinggenesandnoncodingtranscripts