Cargando…

Whole genome sequence and de novo assembly revealed genomic architecture of Indian Mithun (Bos frontalis)

BACKGROUND: Mithun (Bos frontalis), also called gayal, is an endangered bovine species, under the tribe bovini with 2n = 58 XX chromosome complements and reared under the tropical rain forests region of India, China, Myanmar, Bhutan and Bangladesh. However, the origin of this species is still disput...

Descripción completa

Detalles Bibliográficos
Autores principales: Mukherjee, Sabyasachi, Cai, Zexi, Mukherjee, Anupama, Longkumer, Imsusosang, Mech, Moonmoon, Vupru, Kezhavituo, Khate, Kobu, Rajkhowa, Chandan, Mitra, Abhijit, Guldbrandtsen, Bernt, Lund, Mogens Sandø, Sahana, Goutam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6664528/
https://www.ncbi.nlm.nih.gov/pubmed/31357931
http://dx.doi.org/10.1186/s12864-019-5980-y
_version_ 1783439902389567488
author Mukherjee, Sabyasachi
Cai, Zexi
Mukherjee, Anupama
Longkumer, Imsusosang
Mech, Moonmoon
Vupru, Kezhavituo
Khate, Kobu
Rajkhowa, Chandan
Mitra, Abhijit
Guldbrandtsen, Bernt
Lund, Mogens Sandø
Sahana, Goutam
author_facet Mukherjee, Sabyasachi
Cai, Zexi
Mukherjee, Anupama
Longkumer, Imsusosang
Mech, Moonmoon
Vupru, Kezhavituo
Khate, Kobu
Rajkhowa, Chandan
Mitra, Abhijit
Guldbrandtsen, Bernt
Lund, Mogens Sandø
Sahana, Goutam
author_sort Mukherjee, Sabyasachi
collection PubMed
description BACKGROUND: Mithun (Bos frontalis), also called gayal, is an endangered bovine species, under the tribe bovini with 2n = 58 XX chromosome complements and reared under the tropical rain forests region of India, China, Myanmar, Bhutan and Bangladesh. However, the origin of this species is still disputed and information on its genomic architecture is scanty so far. We trust that availability of its whole genome sequence data and assembly will greatly solve this problem and help to generate many information including phylogenetic status of mithun. Recently, the first genome assembly of gayal, mithun of Chinese origin, was published. However, an improved reference genome assembly would still benefit in understanding genetic variation in mithun populations reared under diverse geographical locations and for building a superior consensus assembly. We, therefore, performed deep sequencing of the genome of an adult female mithun from India, assembled and annotated its genome and performed extensive bioinformatic analyses to produce a superior de novo genome assembly of mithun. RESULTS: We generated ≈300 Gigabyte (Gb) raw reads from whole-genome deep sequencing platforms and assembled the sequence data using a hybrid assembly strategy to create a high quality de novo assembly of mithun with 96% recovered as per BUSCO analysis. The final genome assembly has a total length of 3.0 Gb, contains 5,015 scaffolds with an N50 value of 1 Mb. Repeat sequences constitute around 43.66% of the assembly. The genomic alignments between mithun to cattle showed that their genomes, as expected, are highly conserved. Gene annotation identified 28,044 protein-coding genes presented in mithun genome. The gene orthologous groups of mithun showed a high degree of similarity in comparison with other species, while fewer mithun specific coding sequences were found compared to those in cattle. CONCLUSION: Here we presented the first de novo draft genome assembly of Indian mithun having better coverage, less fragmented, better annotated, and constitutes a reasonably complete assembly compared to the previously published gayal genome. This comprehensive assembly unravelled the genomic architecture of mithun to a great extent and will provide a reference genome assembly to research community to elucidate the evolutionary history of mithun across its distinct geographical locations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5980-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6664528
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-66645282019-08-05 Whole genome sequence and de novo assembly revealed genomic architecture of Indian Mithun (Bos frontalis) Mukherjee, Sabyasachi Cai, Zexi Mukherjee, Anupama Longkumer, Imsusosang Mech, Moonmoon Vupru, Kezhavituo Khate, Kobu Rajkhowa, Chandan Mitra, Abhijit Guldbrandtsen, Bernt Lund, Mogens Sandø Sahana, Goutam BMC Genomics Research Article BACKGROUND: Mithun (Bos frontalis), also called gayal, is an endangered bovine species, under the tribe bovini with 2n = 58 XX chromosome complements and reared under the tropical rain forests region of India, China, Myanmar, Bhutan and Bangladesh. However, the origin of this species is still disputed and information on its genomic architecture is scanty so far. We trust that availability of its whole genome sequence data and assembly will greatly solve this problem and help to generate many information including phylogenetic status of mithun. Recently, the first genome assembly of gayal, mithun of Chinese origin, was published. However, an improved reference genome assembly would still benefit in understanding genetic variation in mithun populations reared under diverse geographical locations and for building a superior consensus assembly. We, therefore, performed deep sequencing of the genome of an adult female mithun from India, assembled and annotated its genome and performed extensive bioinformatic analyses to produce a superior de novo genome assembly of mithun. RESULTS: We generated ≈300 Gigabyte (Gb) raw reads from whole-genome deep sequencing platforms and assembled the sequence data using a hybrid assembly strategy to create a high quality de novo assembly of mithun with 96% recovered as per BUSCO analysis. The final genome assembly has a total length of 3.0 Gb, contains 5,015 scaffolds with an N50 value of 1 Mb. Repeat sequences constitute around 43.66% of the assembly. The genomic alignments between mithun to cattle showed that their genomes, as expected, are highly conserved. Gene annotation identified 28,044 protein-coding genes presented in mithun genome. The gene orthologous groups of mithun showed a high degree of similarity in comparison with other species, while fewer mithun specific coding sequences were found compared to those in cattle. CONCLUSION: Here we presented the first de novo draft genome assembly of Indian mithun having better coverage, less fragmented, better annotated, and constitutes a reasonably complete assembly compared to the previously published gayal genome. This comprehensive assembly unravelled the genomic architecture of mithun to a great extent and will provide a reference genome assembly to research community to elucidate the evolutionary history of mithun across its distinct geographical locations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5980-y) contains supplementary material, which is available to authorized users. BioMed Central 2019-07-29 /pmc/articles/PMC6664528/ /pubmed/31357931 http://dx.doi.org/10.1186/s12864-019-5980-y Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Mukherjee, Sabyasachi
Cai, Zexi
Mukherjee, Anupama
Longkumer, Imsusosang
Mech, Moonmoon
Vupru, Kezhavituo
Khate, Kobu
Rajkhowa, Chandan
Mitra, Abhijit
Guldbrandtsen, Bernt
Lund, Mogens Sandø
Sahana, Goutam
Whole genome sequence and de novo assembly revealed genomic architecture of Indian Mithun (Bos frontalis)
title Whole genome sequence and de novo assembly revealed genomic architecture of Indian Mithun (Bos frontalis)
title_full Whole genome sequence and de novo assembly revealed genomic architecture of Indian Mithun (Bos frontalis)
title_fullStr Whole genome sequence and de novo assembly revealed genomic architecture of Indian Mithun (Bos frontalis)
title_full_unstemmed Whole genome sequence and de novo assembly revealed genomic architecture of Indian Mithun (Bos frontalis)
title_short Whole genome sequence and de novo assembly revealed genomic architecture of Indian Mithun (Bos frontalis)
title_sort whole genome sequence and de novo assembly revealed genomic architecture of indian mithun (bos frontalis)
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6664528/
https://www.ncbi.nlm.nih.gov/pubmed/31357931
http://dx.doi.org/10.1186/s12864-019-5980-y
work_keys_str_mv AT mukherjeesabyasachi wholegenomesequenceanddenovoassemblyrevealedgenomicarchitectureofindianmithunbosfrontalis
AT caizexi wholegenomesequenceanddenovoassemblyrevealedgenomicarchitectureofindianmithunbosfrontalis
AT mukherjeeanupama wholegenomesequenceanddenovoassemblyrevealedgenomicarchitectureofindianmithunbosfrontalis
AT longkumerimsusosang wholegenomesequenceanddenovoassemblyrevealedgenomicarchitectureofindianmithunbosfrontalis
AT mechmoonmoon wholegenomesequenceanddenovoassemblyrevealedgenomicarchitectureofindianmithunbosfrontalis
AT vuprukezhavituo wholegenomesequenceanddenovoassemblyrevealedgenomicarchitectureofindianmithunbosfrontalis
AT khatekobu wholegenomesequenceanddenovoassemblyrevealedgenomicarchitectureofindianmithunbosfrontalis
AT rajkhowachandan wholegenomesequenceanddenovoassemblyrevealedgenomicarchitectureofindianmithunbosfrontalis
AT mitraabhijit wholegenomesequenceanddenovoassemblyrevealedgenomicarchitectureofindianmithunbosfrontalis
AT guldbrandtsenbernt wholegenomesequenceanddenovoassemblyrevealedgenomicarchitectureofindianmithunbosfrontalis
AT lundmogenssandø wholegenomesequenceanddenovoassemblyrevealedgenomicarchitectureofindianmithunbosfrontalis
AT sahanagoutam wholegenomesequenceanddenovoassemblyrevealedgenomicarchitectureofindianmithunbosfrontalis