Cargando…

An improved de novo genome assembly of the common marmoset genome yields improved contiguity and increased mapping rates of sequence data

BACKGROUND: The common marmoset (Callithrix jacchus) is one of the most studied primate model organisms. However, the marmoset genomes available in the public databases are highly fragmented and filled with sequence gaps, hindering research advances related to marmoset genomics and transcriptomics....

Descripción completa

Detalles Bibliográficos
Autores principales: Jayakumar, Vasanthan, Ishii, Hiromi, Seki, Misato, Kumita, Wakako, Inoue, Takashi, Hase, Sumitaka, Sato, Kengo, Okano, Hideyuki, Sasaki, Erika, Sakakibara, Yasubumi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7114785/
https://www.ncbi.nlm.nih.gov/pubmed/32241258
http://dx.doi.org/10.1186/s12864-020-6657-2
_version_ 1783513961106243584
author Jayakumar, Vasanthan
Ishii, Hiromi
Seki, Misato
Kumita, Wakako
Inoue, Takashi
Hase, Sumitaka
Sato, Kengo
Okano, Hideyuki
Sasaki, Erika
Sakakibara, Yasubumi
author_facet Jayakumar, Vasanthan
Ishii, Hiromi
Seki, Misato
Kumita, Wakako
Inoue, Takashi
Hase, Sumitaka
Sato, Kengo
Okano, Hideyuki
Sasaki, Erika
Sakakibara, Yasubumi
author_sort Jayakumar, Vasanthan
collection PubMed
description BACKGROUND: The common marmoset (Callithrix jacchus) is one of the most studied primate model organisms. However, the marmoset genomes available in the public databases are highly fragmented and filled with sequence gaps, hindering research advances related to marmoset genomics and transcriptomics. RESULTS: Here we utilize single-molecule, long-read sequence data to improve and update the existing genome assembly and report a near-complete genome of the common marmoset. The assembly is of 2.79 Gb size, with a contig N50 length of 6.37 Mb and a chromosomal scaffold N50 length of 143.91 Mb, representing the most contiguous and high-quality marmoset genome up to date. Approximately 90% of the assembled genome was represented in contigs longer than 1 Mb, with approximately 104-fold improvement in contiguity over the previously published marmoset genome. More than 98% of the gaps from the previously published genomes were filled successfully, which improved the mapping rates of genomic and transcriptomic data on to the assembled genome. CONCLUSIONS: Altogether the updated, high-quality common marmoset genome assembly provide improvements at various levels over the previous versions of the marmoset genome assemblies. This will allow researchers working on primate genomics to apply the genome more efficiently for their genomic and transcriptomic sequence data.
format Online
Article
Text
id pubmed-7114785
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-71147852020-04-07 An improved de novo genome assembly of the common marmoset genome yields improved contiguity and increased mapping rates of sequence data Jayakumar, Vasanthan Ishii, Hiromi Seki, Misato Kumita, Wakako Inoue, Takashi Hase, Sumitaka Sato, Kengo Okano, Hideyuki Sasaki, Erika Sakakibara, Yasubumi BMC Genomics Research BACKGROUND: The common marmoset (Callithrix jacchus) is one of the most studied primate model organisms. However, the marmoset genomes available in the public databases are highly fragmented and filled with sequence gaps, hindering research advances related to marmoset genomics and transcriptomics. RESULTS: Here we utilize single-molecule, long-read sequence data to improve and update the existing genome assembly and report a near-complete genome of the common marmoset. The assembly is of 2.79 Gb size, with a contig N50 length of 6.37 Mb and a chromosomal scaffold N50 length of 143.91 Mb, representing the most contiguous and high-quality marmoset genome up to date. Approximately 90% of the assembled genome was represented in contigs longer than 1 Mb, with approximately 104-fold improvement in contiguity over the previously published marmoset genome. More than 98% of the gaps from the previously published genomes were filled successfully, which improved the mapping rates of genomic and transcriptomic data on to the assembled genome. CONCLUSIONS: Altogether the updated, high-quality common marmoset genome assembly provide improvements at various levels over the previous versions of the marmoset genome assemblies. This will allow researchers working on primate genomics to apply the genome more efficiently for their genomic and transcriptomic sequence data. BioMed Central 2020-04-02 /pmc/articles/PMC7114785/ /pubmed/32241258 http://dx.doi.org/10.1186/s12864-020-6657-2 Text en © The Author(s). 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Jayakumar, Vasanthan
Ishii, Hiromi
Seki, Misato
Kumita, Wakako
Inoue, Takashi
Hase, Sumitaka
Sato, Kengo
Okano, Hideyuki
Sasaki, Erika
Sakakibara, Yasubumi
An improved de novo genome assembly of the common marmoset genome yields improved contiguity and increased mapping rates of sequence data
title An improved de novo genome assembly of the common marmoset genome yields improved contiguity and increased mapping rates of sequence data
title_full An improved de novo genome assembly of the common marmoset genome yields improved contiguity and increased mapping rates of sequence data
title_fullStr An improved de novo genome assembly of the common marmoset genome yields improved contiguity and increased mapping rates of sequence data
title_full_unstemmed An improved de novo genome assembly of the common marmoset genome yields improved contiguity and increased mapping rates of sequence data
title_short An improved de novo genome assembly of the common marmoset genome yields improved contiguity and increased mapping rates of sequence data
title_sort improved de novo genome assembly of the common marmoset genome yields improved contiguity and increased mapping rates of sequence data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7114785/
https://www.ncbi.nlm.nih.gov/pubmed/32241258
http://dx.doi.org/10.1186/s12864-020-6657-2
work_keys_str_mv AT jayakumarvasanthan animproveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT ishiihiromi animproveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT sekimisato animproveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT kumitawakako animproveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT inouetakashi animproveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT hasesumitaka animproveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT satokengo animproveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT okanohideyuki animproveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT sasakierika animproveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT sakakibarayasubumi animproveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT jayakumarvasanthan improveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT ishiihiromi improveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT sekimisato improveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT kumitawakako improveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT inouetakashi improveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT hasesumitaka improveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT satokengo improveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT okanohideyuki improveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT sasakierika improveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata
AT sakakibarayasubumi improveddenovogenomeassemblyofthecommonmarmosetgenomeyieldsimprovedcontiguityandincreasedmappingratesofsequencedata