Cargando…

The revised reference genome of the leopard gecko (Eublepharis macularius) provides insight into the considerations of genome phasing and assembly

Genomic resources across squamate reptiles (lizards and snakes) have lagged behind other vertebrate systems and high-quality reference genomes remain scarce. Of the 23 chromosome-scale reference genomes across the order, only 12 of the ~60 squamate families are represented. Within geckos (infraorder...

Descripción completa

Detalles Bibliográficos
Autores principales: Pinto, Brendan J., Gamble, Tony, Smith, Chase H., Keating, Shannon E., Havird, Justin C., Chiari, Ylenia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882329/
https://www.ncbi.nlm.nih.gov/pubmed/36712019
http://dx.doi.org/10.1101/2023.01.20.523807
_version_ 1784879275392892928
author Pinto, Brendan J.
Gamble, Tony
Smith, Chase H.
Keating, Shannon E.
Havird, Justin C.
Chiari, Ylenia
author_facet Pinto, Brendan J.
Gamble, Tony
Smith, Chase H.
Keating, Shannon E.
Havird, Justin C.
Chiari, Ylenia
author_sort Pinto, Brendan J.
collection PubMed
description Genomic resources across squamate reptiles (lizards and snakes) have lagged behind other vertebrate systems and high-quality reference genomes remain scarce. Of the 23 chromosome-scale reference genomes across the order, only 12 of the ~60 squamate families are represented. Within geckos (infraorder Gekkota), a species-rich clade of lizards, chromosome-level genomes are exceptionally sparse representing only two of the seven extant families. Using the latest advances in genome sequencing and assembly methods, we generated one of the highest quality squamate genomes to date for the leopard gecko, Eublepharis macularius (Eublepharidae). We compared this assembly to the previous, short-read only, E. macularius reference genome published in 2016 and examined potential factors within the assembly influencing contiguity of genome assemblies using PacBio HiFi data. Briefly, the read N50 of the PacBio HiFi reads generated for this study was equal to the contig N50 of the previous E. macularius reference genome at 20.4 kilobases. The HiFi reads were assembled into a total of 132 contigs, which was further scaffolded using HiC data into 75 total sequences representing all 19 chromosomes. We identified that 9 of the 19 chromosomes were assembled as single contigs, while the other 10 chromosomes were each scaffolded together from two or more contigs. We qualitatively identified that percent repeat content within a chromosome broadly affects its assembly contiguity prior to scaffolding. This genome assembly signifies a new age for squamate genomics where high-quality reference genomes rivaling some of the best vertebrate genome assemblies can be generated for a fraction previous cost estimates. This new E. macularius reference assembly is available on NCBI at JAOPLA010000000. The genome version and its associated annotations are also available via this Figshare repository https://doi.org/10.6084/m9.figshare.20069273.
format Online
Article
Text
id pubmed-9882329
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-98823292023-01-28 The revised reference genome of the leopard gecko (Eublepharis macularius) provides insight into the considerations of genome phasing and assembly Pinto, Brendan J. Gamble, Tony Smith, Chase H. Keating, Shannon E. Havird, Justin C. Chiari, Ylenia bioRxiv Article Genomic resources across squamate reptiles (lizards and snakes) have lagged behind other vertebrate systems and high-quality reference genomes remain scarce. Of the 23 chromosome-scale reference genomes across the order, only 12 of the ~60 squamate families are represented. Within geckos (infraorder Gekkota), a species-rich clade of lizards, chromosome-level genomes are exceptionally sparse representing only two of the seven extant families. Using the latest advances in genome sequencing and assembly methods, we generated one of the highest quality squamate genomes to date for the leopard gecko, Eublepharis macularius (Eublepharidae). We compared this assembly to the previous, short-read only, E. macularius reference genome published in 2016 and examined potential factors within the assembly influencing contiguity of genome assemblies using PacBio HiFi data. Briefly, the read N50 of the PacBio HiFi reads generated for this study was equal to the contig N50 of the previous E. macularius reference genome at 20.4 kilobases. The HiFi reads were assembled into a total of 132 contigs, which was further scaffolded using HiC data into 75 total sequences representing all 19 chromosomes. We identified that 9 of the 19 chromosomes were assembled as single contigs, while the other 10 chromosomes were each scaffolded together from two or more contigs. We qualitatively identified that percent repeat content within a chromosome broadly affects its assembly contiguity prior to scaffolding. This genome assembly signifies a new age for squamate genomics where high-quality reference genomes rivaling some of the best vertebrate genome assemblies can be generated for a fraction previous cost estimates. This new E. macularius reference assembly is available on NCBI at JAOPLA010000000. The genome version and its associated annotations are also available via this Figshare repository https://doi.org/10.6084/m9.figshare.20069273. Cold Spring Harbor Laboratory 2023-02-13 /pmc/articles/PMC9882329/ /pubmed/36712019 http://dx.doi.org/10.1101/2023.01.20.523807 Text en https://creativecommons.org/licenses/by-nc/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Pinto, Brendan J.
Gamble, Tony
Smith, Chase H.
Keating, Shannon E.
Havird, Justin C.
Chiari, Ylenia
The revised reference genome of the leopard gecko (Eublepharis macularius) provides insight into the considerations of genome phasing and assembly
title The revised reference genome of the leopard gecko (Eublepharis macularius) provides insight into the considerations of genome phasing and assembly
title_full The revised reference genome of the leopard gecko (Eublepharis macularius) provides insight into the considerations of genome phasing and assembly
title_fullStr The revised reference genome of the leopard gecko (Eublepharis macularius) provides insight into the considerations of genome phasing and assembly
title_full_unstemmed The revised reference genome of the leopard gecko (Eublepharis macularius) provides insight into the considerations of genome phasing and assembly
title_short The revised reference genome of the leopard gecko (Eublepharis macularius) provides insight into the considerations of genome phasing and assembly
title_sort revised reference genome of the leopard gecko (eublepharis macularius) provides insight into the considerations of genome phasing and assembly
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882329/
https://www.ncbi.nlm.nih.gov/pubmed/36712019
http://dx.doi.org/10.1101/2023.01.20.523807
work_keys_str_mv AT pintobrendanj therevisedreferencegenomeoftheleopardgeckoeublepharismaculariusprovidesinsightintotheconsiderationsofgenomephasingandassembly
AT gambletony therevisedreferencegenomeoftheleopardgeckoeublepharismaculariusprovidesinsightintotheconsiderationsofgenomephasingandassembly
AT smithchaseh therevisedreferencegenomeoftheleopardgeckoeublepharismaculariusprovidesinsightintotheconsiderationsofgenomephasingandassembly
AT keatingshannone therevisedreferencegenomeoftheleopardgeckoeublepharismaculariusprovidesinsightintotheconsiderationsofgenomephasingandassembly
AT havirdjustinc therevisedreferencegenomeoftheleopardgeckoeublepharismaculariusprovidesinsightintotheconsiderationsofgenomephasingandassembly
AT chiariylenia therevisedreferencegenomeoftheleopardgeckoeublepharismaculariusprovidesinsightintotheconsiderationsofgenomephasingandassembly
AT pintobrendanj revisedreferencegenomeoftheleopardgeckoeublepharismaculariusprovidesinsightintotheconsiderationsofgenomephasingandassembly
AT gambletony revisedreferencegenomeoftheleopardgeckoeublepharismaculariusprovidesinsightintotheconsiderationsofgenomephasingandassembly
AT smithchaseh revisedreferencegenomeoftheleopardgeckoeublepharismaculariusprovidesinsightintotheconsiderationsofgenomephasingandassembly
AT keatingshannone revisedreferencegenomeoftheleopardgeckoeublepharismaculariusprovidesinsightintotheconsiderationsofgenomephasingandassembly
AT havirdjustinc revisedreferencegenomeoftheleopardgeckoeublepharismaculariusprovidesinsightintotheconsiderationsofgenomephasingandassembly
AT chiariylenia revisedreferencegenomeoftheleopardgeckoeublepharismaculariusprovidesinsightintotheconsiderationsofgenomephasingandassembly