Cargando…

Whole genome sequence and annotation dataset of rare actinobacteria, Barrientosiimonas humi gen. nov., sp. nov. 39(T) from Antarctica

Barrientosiimonas humi gen. nov., sp. nov. 39(T) is a rare actinobacteria strain isolated from the less explored extreme environment of the Antarctic soil. Here, we present the whole genome sequencing and annotation data from the high-quality draft genome of B. humi from Antarctica. The extracted ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Chong, Sin Yee, Azmi, Aida Azrina, Cheah, Yoke Kqueen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10590835/
https://www.ncbi.nlm.nih.gov/pubmed/37876741
http://dx.doi.org/10.1016/j.dib.2023.109657
_version_ 1785124085635743744
author Chong, Sin Yee
Azmi, Aida Azrina
Cheah, Yoke Kqueen
author_facet Chong, Sin Yee
Azmi, Aida Azrina
Cheah, Yoke Kqueen
author_sort Chong, Sin Yee
collection PubMed
description Barrientosiimonas humi gen. nov., sp. nov. 39(T) is a rare actinobacteria strain isolated from the less explored extreme environment of the Antarctic soil. Here, we present the whole genome sequencing and annotation data from the high-quality draft genome of B. humi from Antarctica. The extracted genomic deoxyribonucleic acid (DNA) was sequenced using the PacBio Sequel sequencing platform, followed by the Illumina HiSeq sequencing system. Subsequently, the assembly data from Canu 1.7 and Pilon were subjected to bioinformatics analysis for genome annotation to analyze the entire genomic information of the sequences. Different bioinformatics analysis approaches were used to disclose a high-quality draft genome basis for B. humi and provided a better understanding of its biological and molecular functions. Note that 83,639 reads were predicted from its 3.6Mb genome size, with a guanine-cytosine content (GC) content of 72.39%. The genome was assembled into two contigs, where the larger contig represents the chromosome and the smaller contig represents the plasmid. It is composed of 3,381 coding genes, with about 95% of them being functionally annotated. It consists of 3,318 coding sequences, one tmRNA gene, 57 tRNA genes, and five repeated regions. B. humi was evident, sharing a close sequence similarity with the species Demetria terragena and the family Dermacoccaceae. Gene Ontology (GO) functional classification indicated cell and cell parts were highly represented among the cellular component category; catalytic activity and binding were the most enriched processes within the molecular function category; metabolic and cellular processes were the most represented in the biological process category. Clusters of Orthologous Group (COG) functional classification revealed metabolism-related genes were highly enriched and mostly mapped to amino acid transport metabolism, transcription, energy production, and conversion. Moreover, the Kyoto Encyclopedia of Genes and Genomes (KEGG) functional classification reported that the metabolism process was the most represented KEGG pathway. There were 52 biosynthetic gene clusters involved in secondary metabolites biosynthesis, indicating B. humi has antibacterial, antifungal, cytotoxic, and inhibitor bioactivities. The dataset of the whole-genome sequence of B. humi has been deposited in the European Nucleotide Archive (ENA) repository under the accession number PRJEB44986 / ERP129097. The dataset of the genome annotation of B. humi had been deposited in Zenodo. The reported genomic sequence data for B. humi contributes comprehensive data to the current molecular information of the species, serving as a significant approach that facilitates the advancement of medicine.
format Online
Article
Text
id pubmed-10590835
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-105908352023-10-24 Whole genome sequence and annotation dataset of rare actinobacteria, Barrientosiimonas humi gen. nov., sp. nov. 39(T) from Antarctica Chong, Sin Yee Azmi, Aida Azrina Cheah, Yoke Kqueen Data Brief Data Article Barrientosiimonas humi gen. nov., sp. nov. 39(T) is a rare actinobacteria strain isolated from the less explored extreme environment of the Antarctic soil. Here, we present the whole genome sequencing and annotation data from the high-quality draft genome of B. humi from Antarctica. The extracted genomic deoxyribonucleic acid (DNA) was sequenced using the PacBio Sequel sequencing platform, followed by the Illumina HiSeq sequencing system. Subsequently, the assembly data from Canu 1.7 and Pilon were subjected to bioinformatics analysis for genome annotation to analyze the entire genomic information of the sequences. Different bioinformatics analysis approaches were used to disclose a high-quality draft genome basis for B. humi and provided a better understanding of its biological and molecular functions. Note that 83,639 reads were predicted from its 3.6Mb genome size, with a guanine-cytosine content (GC) content of 72.39%. The genome was assembled into two contigs, where the larger contig represents the chromosome and the smaller contig represents the plasmid. It is composed of 3,381 coding genes, with about 95% of them being functionally annotated. It consists of 3,318 coding sequences, one tmRNA gene, 57 tRNA genes, and five repeated regions. B. humi was evident, sharing a close sequence similarity with the species Demetria terragena and the family Dermacoccaceae. Gene Ontology (GO) functional classification indicated cell and cell parts were highly represented among the cellular component category; catalytic activity and binding were the most enriched processes within the molecular function category; metabolic and cellular processes were the most represented in the biological process category. Clusters of Orthologous Group (COG) functional classification revealed metabolism-related genes were highly enriched and mostly mapped to amino acid transport metabolism, transcription, energy production, and conversion. Moreover, the Kyoto Encyclopedia of Genes and Genomes (KEGG) functional classification reported that the metabolism process was the most represented KEGG pathway. There were 52 biosynthetic gene clusters involved in secondary metabolites biosynthesis, indicating B. humi has antibacterial, antifungal, cytotoxic, and inhibitor bioactivities. The dataset of the whole-genome sequence of B. humi has been deposited in the European Nucleotide Archive (ENA) repository under the accession number PRJEB44986 / ERP129097. The dataset of the genome annotation of B. humi had been deposited in Zenodo. The reported genomic sequence data for B. humi contributes comprehensive data to the current molecular information of the species, serving as a significant approach that facilitates the advancement of medicine. Elsevier 2023-10-12 /pmc/articles/PMC10590835/ /pubmed/37876741 http://dx.doi.org/10.1016/j.dib.2023.109657 Text en © 2023 The Authors. Published by Elsevier Inc. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Chong, Sin Yee
Azmi, Aida Azrina
Cheah, Yoke Kqueen
Whole genome sequence and annotation dataset of rare actinobacteria, Barrientosiimonas humi gen. nov., sp. nov. 39(T) from Antarctica
title Whole genome sequence and annotation dataset of rare actinobacteria, Barrientosiimonas humi gen. nov., sp. nov. 39(T) from Antarctica
title_full Whole genome sequence and annotation dataset of rare actinobacteria, Barrientosiimonas humi gen. nov., sp. nov. 39(T) from Antarctica
title_fullStr Whole genome sequence and annotation dataset of rare actinobacteria, Barrientosiimonas humi gen. nov., sp. nov. 39(T) from Antarctica
title_full_unstemmed Whole genome sequence and annotation dataset of rare actinobacteria, Barrientosiimonas humi gen. nov., sp. nov. 39(T) from Antarctica
title_short Whole genome sequence and annotation dataset of rare actinobacteria, Barrientosiimonas humi gen. nov., sp. nov. 39(T) from Antarctica
title_sort whole genome sequence and annotation dataset of rare actinobacteria, barrientosiimonas humi gen. nov., sp. nov. 39(t) from antarctica
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10590835/
https://www.ncbi.nlm.nih.gov/pubmed/37876741
http://dx.doi.org/10.1016/j.dib.2023.109657
work_keys_str_mv AT chongsinyee wholegenomesequenceandannotationdatasetofrareactinobacteriabarrientosiimonashumigennovspnov39tfromantarctica
AT azmiaidaazrina wholegenomesequenceandannotationdatasetofrareactinobacteriabarrientosiimonashumigennovspnov39tfromantarctica
AT cheahyokekqueen wholegenomesequenceandannotationdatasetofrareactinobacteriabarrientosiimonashumigennovspnov39tfromantarctica