Cargando…

First high-quality genome assembly data of sago palm (Metroxylon sagu Rottboll)

The sago palm (Metroxylon sagu Rottboll) is a tropical halophytic starch-producing, economically important crop palm mainly located in Southeast Asian countries. Recently, a genome survey was conducted on this palm using the Illumina sequencing platform, with a very low (21.5%) BUSCO genome complete...

Descripción completa

Detalles Bibliográficos
Autores principales: Lim, Leonard Whye Kit, Lau, Melinda Mei Lin, Chung, Hung Hui, Hussain, Hasnain, Gan, Han Ming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8760474/
https://www.ncbi.nlm.nih.gov/pubmed/35059482
http://dx.doi.org/10.1016/j.dib.2022.107800
_version_ 1784633327805792256
author Lim, Leonard Whye Kit
Lau, Melinda Mei Lin
Chung, Hung Hui
Hussain, Hasnain
Gan, Han Ming
author_facet Lim, Leonard Whye Kit
Lau, Melinda Mei Lin
Chung, Hung Hui
Hussain, Hasnain
Gan, Han Ming
author_sort Lim, Leonard Whye Kit
collection PubMed
description The sago palm (Metroxylon sagu Rottboll) is a tropical halophytic starch-producing, economically important crop palm mainly located in Southeast Asian countries. Recently, a genome survey was conducted on this palm using the Illumina sequencing platform, with a very low (21.5%) BUSCO genome completeness score, and most of them (∼78%) are either fragmented or missing. Thus, in this study, the sago palm genome completeness was further improved with the utilization of the Nanopore sequencing platform that produced longer reads. A hybrid genome assembly was conducted, and the outcome was a much complete sago palm genome with BUSCO completeness achieved at as high as 97.9%, with only ∼2% of them either fragmented or missing. The estimated genome size of the sago palm is 509,812,790 bp in this study. A sum of 33,242 protein-coding genes was revealed from the sago palm genome and around 96.39% of them had been functionally annotated. An investigation on the carbohydrate metabolism KEGG pathways also unearthed that starch synthesis was one of the major sago palm activities. The genome data obtained from this work is indispensable for future molecular evolutionary and genome-wide association studies on the economically important sago palm.
format Online
Article
Text
id pubmed-8760474
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-87604742022-01-19 First high-quality genome assembly data of sago palm (Metroxylon sagu Rottboll) Lim, Leonard Whye Kit Lau, Melinda Mei Lin Chung, Hung Hui Hussain, Hasnain Gan, Han Ming Data Brief Data Article The sago palm (Metroxylon sagu Rottboll) is a tropical halophytic starch-producing, economically important crop palm mainly located in Southeast Asian countries. Recently, a genome survey was conducted on this palm using the Illumina sequencing platform, with a very low (21.5%) BUSCO genome completeness score, and most of them (∼78%) are either fragmented or missing. Thus, in this study, the sago palm genome completeness was further improved with the utilization of the Nanopore sequencing platform that produced longer reads. A hybrid genome assembly was conducted, and the outcome was a much complete sago palm genome with BUSCO completeness achieved at as high as 97.9%, with only ∼2% of them either fragmented or missing. The estimated genome size of the sago palm is 509,812,790 bp in this study. A sum of 33,242 protein-coding genes was revealed from the sago palm genome and around 96.39% of them had been functionally annotated. An investigation on the carbohydrate metabolism KEGG pathways also unearthed that starch synthesis was one of the major sago palm activities. The genome data obtained from this work is indispensable for future molecular evolutionary and genome-wide association studies on the economically important sago palm. Elsevier 2022-01-06 /pmc/articles/PMC8760474/ /pubmed/35059482 http://dx.doi.org/10.1016/j.dib.2022.107800 Text en © 2022 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Lim, Leonard Whye Kit
Lau, Melinda Mei Lin
Chung, Hung Hui
Hussain, Hasnain
Gan, Han Ming
First high-quality genome assembly data of sago palm (Metroxylon sagu Rottboll)
title First high-quality genome assembly data of sago palm (Metroxylon sagu Rottboll)
title_full First high-quality genome assembly data of sago palm (Metroxylon sagu Rottboll)
title_fullStr First high-quality genome assembly data of sago palm (Metroxylon sagu Rottboll)
title_full_unstemmed First high-quality genome assembly data of sago palm (Metroxylon sagu Rottboll)
title_short First high-quality genome assembly data of sago palm (Metroxylon sagu Rottboll)
title_sort first high-quality genome assembly data of sago palm (metroxylon sagu rottboll)
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8760474/
https://www.ncbi.nlm.nih.gov/pubmed/35059482
http://dx.doi.org/10.1016/j.dib.2022.107800
work_keys_str_mv AT limleonardwhyekit firsthighqualitygenomeassemblydataofsagopalmmetroxylonsagurottboll
AT laumelindameilin firsthighqualitygenomeassemblydataofsagopalmmetroxylonsagurottboll
AT chunghunghui firsthighqualitygenomeassemblydataofsagopalmmetroxylonsagurottboll
AT hussainhasnain firsthighqualitygenomeassemblydataofsagopalmmetroxylonsagurottboll
AT ganhanming firsthighqualitygenomeassemblydataofsagopalmmetroxylonsagurottboll