Cargando…

Draft genome assembly of the Aral barbell Luciobarbus brachycephalus using PacBio sequencing

The endangered Aral barbell Luciobarbus brachycephalus is endemic to the water systems of the Caspian Sea and Aral Sea. Given the scarcity of genetic data for the species, we present a draft assembly based on PacBio long read sequencing technology. Approximate 299.4 Gb of long reads representing 166...

Descripción completa

Detalles Bibliográficos
Autores principales: Geng, Longwu, Zou, Ming, Jiang, Haifeng, Meng, Minghui, Xu, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8489429/
https://www.ncbi.nlm.nih.gov/pubmed/34255058
http://dx.doi.org/10.1093/gbe/evab131
_version_ 1784578341302435840
author Geng, Longwu
Zou, Ming
Jiang, Haifeng
Meng, Minghui
Xu, Wei
author_facet Geng, Longwu
Zou, Ming
Jiang, Haifeng
Meng, Minghui
Xu, Wei
author_sort Geng, Longwu
collection PubMed
description The endangered Aral barbell Luciobarbus brachycephalus is endemic to the water systems of the Caspian Sea and Aral Sea. Given the scarcity of genetic data for the species, we present a draft assembly based on PacBio long read sequencing technology. Approximate 299.4 Gb of long reads representing 166X of the estimated genome size were generated, and the final assembly was composed of 653 contigs totaling approximately 1,698.3 Mb, with a contig N50 length of 4.5 Mb. A total of 807.6 Mb represented approximately 47.6% of the assembly and were identified as repeats. Fifty-four thousand and six hundred possible protein genes were predicted, among which 50,727, representing approximately 92.9%, could be annotated by at least one database. Evolutionary analysis showed that L. brachycephalus and Labeo rohita diverged by approximately 42.6 Mya, and the obvious expansion of gene families residing in the L. brachycephalus genome may be attributed to the specific whole genome duplication of the species. The first genome assembly of L. brachycephalus can not only provide a foundation for genetic conservation and molecular breeding of this species but also contribute to comparative analyses of genome biology and evolution within Cyprinidae.
format Online
Article
Text
id pubmed-8489429
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-84894292021-10-05 Draft genome assembly of the Aral barbell Luciobarbus brachycephalus using PacBio sequencing Geng, Longwu Zou, Ming Jiang, Haifeng Meng, Minghui Xu, Wei Genome Biol Evol Genome Report The endangered Aral barbell Luciobarbus brachycephalus is endemic to the water systems of the Caspian Sea and Aral Sea. Given the scarcity of genetic data for the species, we present a draft assembly based on PacBio long read sequencing technology. Approximate 299.4 Gb of long reads representing 166X of the estimated genome size were generated, and the final assembly was composed of 653 contigs totaling approximately 1,698.3 Mb, with a contig N50 length of 4.5 Mb. A total of 807.6 Mb represented approximately 47.6% of the assembly and were identified as repeats. Fifty-four thousand and six hundred possible protein genes were predicted, among which 50,727, representing approximately 92.9%, could be annotated by at least one database. Evolutionary analysis showed that L. brachycephalus and Labeo rohita diverged by approximately 42.6 Mya, and the obvious expansion of gene families residing in the L. brachycephalus genome may be attributed to the specific whole genome duplication of the species. The first genome assembly of L. brachycephalus can not only provide a foundation for genetic conservation and molecular breeding of this species but also contribute to comparative analyses of genome biology and evolution within Cyprinidae. Oxford University Press 2021-07-13 /pmc/articles/PMC8489429/ /pubmed/34255058 http://dx.doi.org/10.1093/gbe/evab131 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genome Report
Geng, Longwu
Zou, Ming
Jiang, Haifeng
Meng, Minghui
Xu, Wei
Draft genome assembly of the Aral barbell Luciobarbus brachycephalus using PacBio sequencing
title Draft genome assembly of the Aral barbell Luciobarbus brachycephalus using PacBio sequencing
title_full Draft genome assembly of the Aral barbell Luciobarbus brachycephalus using PacBio sequencing
title_fullStr Draft genome assembly of the Aral barbell Luciobarbus brachycephalus using PacBio sequencing
title_full_unstemmed Draft genome assembly of the Aral barbell Luciobarbus brachycephalus using PacBio sequencing
title_short Draft genome assembly of the Aral barbell Luciobarbus brachycephalus using PacBio sequencing
title_sort draft genome assembly of the aral barbell luciobarbus brachycephalus using pacbio sequencing
topic Genome Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8489429/
https://www.ncbi.nlm.nih.gov/pubmed/34255058
http://dx.doi.org/10.1093/gbe/evab131
work_keys_str_mv AT genglongwu draftgenomeassemblyofthearalbarbellluciobarbusbrachycephalususingpacbiosequencing
AT zouming draftgenomeassemblyofthearalbarbellluciobarbusbrachycephalususingpacbiosequencing
AT jianghaifeng draftgenomeassemblyofthearalbarbellluciobarbusbrachycephalususingpacbiosequencing
AT mengminghui draftgenomeassemblyofthearalbarbellluciobarbusbrachycephalususingpacbiosequencing
AT xuwei draftgenomeassemblyofthearalbarbellluciobarbusbrachycephalususingpacbiosequencing