Cargando…

Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C

BACKGROUND: The German Shepherd Dog (GSD) is one of the most common breeds on earth and has been bred for its utility and intelligence. It is often first choice for police and military work, as well as protection, disability assistance, and search-and-rescue. Yet, GSDs are well known to be susceptib...

Descripción completa

Detalles Bibliográficos
Autores principales: Field, Matt A, Rosen, Benjamin D, Dudchenko, Olga, Chan, Eva K F, Minoche, Andre E, Edwards, Richard J, Barton, Kirston, Lyons, Ruth J, Tuipulotu, Daniel Enosi, Hayes, Vanessa M, D. Omer, Arina, Colaric, Zane, Keilwagen, Jens, Skvortsova, Ksenia, Bogdanovic, Ozren, Smith, Martin A, Aiden, Erez Lieberman, Smith, Timothy P L, Zammit, Robert A, Ballard, J William O
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7111595/
https://www.ncbi.nlm.nih.gov/pubmed/32236524
http://dx.doi.org/10.1093/gigascience/giaa027
_version_ 1783513315040821248
author Field, Matt A
Rosen, Benjamin D
Dudchenko, Olga
Chan, Eva K F
Minoche, Andre E
Edwards, Richard J
Barton, Kirston
Lyons, Ruth J
Tuipulotu, Daniel Enosi
Hayes, Vanessa M
D. Omer, Arina
Colaric, Zane
Keilwagen, Jens
Skvortsova, Ksenia
Bogdanovic, Ozren
Smith, Martin A
Aiden, Erez Lieberman
Smith, Timothy P L
Zammit, Robert A
Ballard, J William O
author_facet Field, Matt A
Rosen, Benjamin D
Dudchenko, Olga
Chan, Eva K F
Minoche, Andre E
Edwards, Richard J
Barton, Kirston
Lyons, Ruth J
Tuipulotu, Daniel Enosi
Hayes, Vanessa M
D. Omer, Arina
Colaric, Zane
Keilwagen, Jens
Skvortsova, Ksenia
Bogdanovic, Ozren
Smith, Martin A
Aiden, Erez Lieberman
Smith, Timothy P L
Zammit, Robert A
Ballard, J William O
author_sort Field, Matt A
collection PubMed
description BACKGROUND: The German Shepherd Dog (GSD) is one of the most common breeds on earth and has been bred for its utility and intelligence. It is often first choice for police and military work, as well as protection, disability assistance, and search-and-rescue. Yet, GSDs are well known to be susceptible to a range of genetic diseases that can interfere with their training. Such diseases are of particular concern when they occur later in life, and fully trained animals are not able to continue their duties. FINDINGS: Here, we provide the draft genome sequence of a healthy German Shepherd female as a reference for future disease and evolutionary studies. We generated this improved canid reference genome (CanFam_GSD) utilizing a combination of Pacific Bioscience, Oxford Nanopore, 10X Genomics, Bionano, and Hi-C technologies. The GSD assembly is ∼80 times as contiguous as the current canid reference genome (20.9 vs 0.267 Mb contig N50), containing far fewer gaps (306 vs 23,876) and fewer scaffolds (429 vs 3,310) than the current canid reference genome CanFamv3.1. Two chromosomes (4 and 35) are assembled into single scaffolds with no gaps. BUSCO analyses of the genome assembly results show that 93.0% of the conserved single-copy genes are complete in the GSD assembly compared with 92.2% for CanFam v3.1. Homology-based gene annotation increases this value to ∼99%. Detailed examination of the evolutionarily important pancreatic amylase region reveals that there are most likely 7 copies of the gene, indicative of a duplication of 4 ancestral copies and the disruption of 1 copy. CONCLUSIONS: GSD genome assembly and annotation were produced with major improvement in completeness, continuity, and quality over the existing canid reference. This resource will enable further research related to canine diseases, the evolutionary relationships of canids, and other aspects of canid biology.
format Online
Article
Text
id pubmed-7111595
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-71115952020-04-06 Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C Field, Matt A Rosen, Benjamin D Dudchenko, Olga Chan, Eva K F Minoche, Andre E Edwards, Richard J Barton, Kirston Lyons, Ruth J Tuipulotu, Daniel Enosi Hayes, Vanessa M D. Omer, Arina Colaric, Zane Keilwagen, Jens Skvortsova, Ksenia Bogdanovic, Ozren Smith, Martin A Aiden, Erez Lieberman Smith, Timothy P L Zammit, Robert A Ballard, J William O Gigascience Research BACKGROUND: The German Shepherd Dog (GSD) is one of the most common breeds on earth and has been bred for its utility and intelligence. It is often first choice for police and military work, as well as protection, disability assistance, and search-and-rescue. Yet, GSDs are well known to be susceptible to a range of genetic diseases that can interfere with their training. Such diseases are of particular concern when they occur later in life, and fully trained animals are not able to continue their duties. FINDINGS: Here, we provide the draft genome sequence of a healthy German Shepherd female as a reference for future disease and evolutionary studies. We generated this improved canid reference genome (CanFam_GSD) utilizing a combination of Pacific Bioscience, Oxford Nanopore, 10X Genomics, Bionano, and Hi-C technologies. The GSD assembly is ∼80 times as contiguous as the current canid reference genome (20.9 vs 0.267 Mb contig N50), containing far fewer gaps (306 vs 23,876) and fewer scaffolds (429 vs 3,310) than the current canid reference genome CanFamv3.1. Two chromosomes (4 and 35) are assembled into single scaffolds with no gaps. BUSCO analyses of the genome assembly results show that 93.0% of the conserved single-copy genes are complete in the GSD assembly compared with 92.2% for CanFam v3.1. Homology-based gene annotation increases this value to ∼99%. Detailed examination of the evolutionarily important pancreatic amylase region reveals that there are most likely 7 copies of the gene, indicative of a duplication of 4 ancestral copies and the disruption of 1 copy. CONCLUSIONS: GSD genome assembly and annotation were produced with major improvement in completeness, continuity, and quality over the existing canid reference. This resource will enable further research related to canine diseases, the evolutionary relationships of canids, and other aspects of canid biology. Oxford University Press 2020-04-01 /pmc/articles/PMC7111595/ /pubmed/32236524 http://dx.doi.org/10.1093/gigascience/giaa027 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Field, Matt A
Rosen, Benjamin D
Dudchenko, Olga
Chan, Eva K F
Minoche, Andre E
Edwards, Richard J
Barton, Kirston
Lyons, Ruth J
Tuipulotu, Daniel Enosi
Hayes, Vanessa M
D. Omer, Arina
Colaric, Zane
Keilwagen, Jens
Skvortsova, Ksenia
Bogdanovic, Ozren
Smith, Martin A
Aiden, Erez Lieberman
Smith, Timothy P L
Zammit, Robert A
Ballard, J William O
Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C
title Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C
title_full Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C
title_fullStr Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C
title_full_unstemmed Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C
title_short Canfam_GSD: De novo chromosome-length genome assembly of the German Shepherd Dog (Canis lupus familiaris) using a combination of long reads, optical mapping, and Hi-C
title_sort canfam_gsd: de novo chromosome-length genome assembly of the german shepherd dog (canis lupus familiaris) using a combination of long reads, optical mapping, and hi-c
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7111595/
https://www.ncbi.nlm.nih.gov/pubmed/32236524
http://dx.doi.org/10.1093/gigascience/giaa027
work_keys_str_mv AT fieldmatta canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT rosenbenjamind canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT dudchenkoolga canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT chanevakf canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT minocheandree canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT edwardsrichardj canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT bartonkirston canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT lyonsruthj canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT tuipulotudanielenosi canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT hayesvanessam canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT domerarina canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT colariczane canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT keilwagenjens canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT skvortsovaksenia canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT bogdanovicozren canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT smithmartina canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT aidenerezlieberman canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT smithtimothypl canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT zammitroberta canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic
AT ballardjwilliamo canfamgsddenovochromosomelengthgenomeassemblyofthegermanshepherddogcanislupusfamiliarisusingacombinationoflongreadsopticalmappingandhic