Cargando…

First de novo whole genome sequencing and assembly of the bar-headed goose

BACKGROUND: The bar-headed goose (Anser indicus) mainly inhabits the plateau wetlands of Asia. As a specialized high-altitude species, bar-headed geese can migrate between South and Central Asia and annually fly twice over the Himalayan mountains along the central Asian flyway. The physiological, bi...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Wen, Wang, Fang, Hao, Rongkai, Wang, Aizhen, Sharshov, Kirill, Druzyaka, Alexey, Lancuo, Zhuoma, Shi, Yuetong, Feng, Shuo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7144584/
https://www.ncbi.nlm.nih.gov/pubmed/32292659
http://dx.doi.org/10.7717/peerj.8914
_version_ 1783519862448979968
author Wang, Wen
Wang, Fang
Hao, Rongkai
Wang, Aizhen
Sharshov, Kirill
Druzyaka, Alexey
Lancuo, Zhuoma
Shi, Yuetong
Feng, Shuo
author_facet Wang, Wen
Wang, Fang
Hao, Rongkai
Wang, Aizhen
Sharshov, Kirill
Druzyaka, Alexey
Lancuo, Zhuoma
Shi, Yuetong
Feng, Shuo
author_sort Wang, Wen
collection PubMed
description BACKGROUND: The bar-headed goose (Anser indicus) mainly inhabits the plateau wetlands of Asia. As a specialized high-altitude species, bar-headed geese can migrate between South and Central Asia and annually fly twice over the Himalayan mountains along the central Asian flyway. The physiological, biochemical and behavioral adaptations of bar-headed geese to high-altitude living and flying have raised much interest. However, to date, there is still no genome assembly information publicly available for bar-headed geese. METHODS: In this study, we present the first de novo whole genome sequencing and assembly of the bar-headed goose, along with gene prediction and annotation. RESULTS: 10X Genomics sequencing produced a total of 124 Gb sequencing data, which can cover the estimated genome size of bar-headed goose for 103 times (average coverage). The genome assembly comprised 10,528 scaffolds, with a total length of 1.143 Gb and a scaffold N50 of 10.09 Mb. Annotation of the bar-headed goose genome assembly identified a total of 102 Mb (8.9%) of repetitive sequences, 16,428 protein-coding genes, and 282 tRNAs. In total, we determined that there were 63 expanded and 20 contracted gene families in the bar-headed goose compared with the other 15 vertebrates. We also performed a positive selection analysis between the bar-headed goose and the closely related low-altitude goose, swan goose (Anser cygnoides), to uncover its genetic adaptations to the Qinghai-Tibetan Plateau. CONCLUSION: We reported the currently most complete genome sequence of the bar-headed goose. Our assembly will provide a valuable resource to enhance further studies of the gene functions of bar-headed goose. The data will also be valuable for facilitating studies of the evolution, population genetics and high-altitude adaptations of the bar-headed geese at the genomic level.
format Online
Article
Text
id pubmed-7144584
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-71445842020-04-14 First de novo whole genome sequencing and assembly of the bar-headed goose Wang, Wen Wang, Fang Hao, Rongkai Wang, Aizhen Sharshov, Kirill Druzyaka, Alexey Lancuo, Zhuoma Shi, Yuetong Feng, Shuo PeerJ Genomics BACKGROUND: The bar-headed goose (Anser indicus) mainly inhabits the plateau wetlands of Asia. As a specialized high-altitude species, bar-headed geese can migrate between South and Central Asia and annually fly twice over the Himalayan mountains along the central Asian flyway. The physiological, biochemical and behavioral adaptations of bar-headed geese to high-altitude living and flying have raised much interest. However, to date, there is still no genome assembly information publicly available for bar-headed geese. METHODS: In this study, we present the first de novo whole genome sequencing and assembly of the bar-headed goose, along with gene prediction and annotation. RESULTS: 10X Genomics sequencing produced a total of 124 Gb sequencing data, which can cover the estimated genome size of bar-headed goose for 103 times (average coverage). The genome assembly comprised 10,528 scaffolds, with a total length of 1.143 Gb and a scaffold N50 of 10.09 Mb. Annotation of the bar-headed goose genome assembly identified a total of 102 Mb (8.9%) of repetitive sequences, 16,428 protein-coding genes, and 282 tRNAs. In total, we determined that there were 63 expanded and 20 contracted gene families in the bar-headed goose compared with the other 15 vertebrates. We also performed a positive selection analysis between the bar-headed goose and the closely related low-altitude goose, swan goose (Anser cygnoides), to uncover its genetic adaptations to the Qinghai-Tibetan Plateau. CONCLUSION: We reported the currently most complete genome sequence of the bar-headed goose. Our assembly will provide a valuable resource to enhance further studies of the gene functions of bar-headed goose. The data will also be valuable for facilitating studies of the evolution, population genetics and high-altitude adaptations of the bar-headed geese at the genomic level. PeerJ Inc. 2020-04-06 /pmc/articles/PMC7144584/ /pubmed/32292659 http://dx.doi.org/10.7717/peerj.8914 Text en ©2020 Wang et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Genomics
Wang, Wen
Wang, Fang
Hao, Rongkai
Wang, Aizhen
Sharshov, Kirill
Druzyaka, Alexey
Lancuo, Zhuoma
Shi, Yuetong
Feng, Shuo
First de novo whole genome sequencing and assembly of the bar-headed goose
title First de novo whole genome sequencing and assembly of the bar-headed goose
title_full First de novo whole genome sequencing and assembly of the bar-headed goose
title_fullStr First de novo whole genome sequencing and assembly of the bar-headed goose
title_full_unstemmed First de novo whole genome sequencing and assembly of the bar-headed goose
title_short First de novo whole genome sequencing and assembly of the bar-headed goose
title_sort first de novo whole genome sequencing and assembly of the bar-headed goose
topic Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7144584/
https://www.ncbi.nlm.nih.gov/pubmed/32292659
http://dx.doi.org/10.7717/peerj.8914
work_keys_str_mv AT wangwen firstdenovowholegenomesequencingandassemblyofthebarheadedgoose
AT wangfang firstdenovowholegenomesequencingandassemblyofthebarheadedgoose
AT haorongkai firstdenovowholegenomesequencingandassemblyofthebarheadedgoose
AT wangaizhen firstdenovowholegenomesequencingandassemblyofthebarheadedgoose
AT sharshovkirill firstdenovowholegenomesequencingandassemblyofthebarheadedgoose
AT druzyakaalexey firstdenovowholegenomesequencingandassemblyofthebarheadedgoose
AT lancuozhuoma firstdenovowholegenomesequencingandassemblyofthebarheadedgoose
AT shiyuetong firstdenovowholegenomesequencingandassemblyofthebarheadedgoose
AT fengshuo firstdenovowholegenomesequencingandassemblyofthebarheadedgoose