Cargando…

Estimate of the sequenced proportion of the global prokaryotic genome

BACKGROUND: Sequencing prokaryotic genomes has revolutionized our understanding of the many roles played by microorganisms. However, the cell and taxon proportions of genome-sequenced bacteria or archaea on earth remain unknown. This study aimed to explore this basic question using large-scale align...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Zheng, Wang, Jianing, Wang, Jinlan, Wang, Jingjing, Li, Yuezhong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7496214/
https://www.ncbi.nlm.nih.gov/pubmed/32938501
http://dx.doi.org/10.1186/s40168-020-00903-z
_version_ 1783583048898445312
author Zhang, Zheng
Wang, Jianing
Wang, Jinlan
Wang, Jingjing
Li, Yuezhong
author_facet Zhang, Zheng
Wang, Jianing
Wang, Jinlan
Wang, Jingjing
Li, Yuezhong
author_sort Zhang, Zheng
collection PubMed
description BACKGROUND: Sequencing prokaryotic genomes has revolutionized our understanding of the many roles played by microorganisms. However, the cell and taxon proportions of genome-sequenced bacteria or archaea on earth remain unknown. This study aimed to explore this basic question using large-scale alignment between the sequences released by the Earth Microbiome Project and 155,810 prokaryotic genomes from public databases. RESULTS: Our results showed that the median proportions of the genome-sequenced cells and taxa (at 100% identities in the 16S-V4 region) in different biomes reached 38.1% (16.4–86.3%) and 18.8% (9.1–52.6%), respectively. The sequenced proportions of the prokaryotic genomes in biomes were significantly negatively correlated with the alpha diversity indices, and the proportions sequenced in host-associated biomes were significantly higher than those in free-living biomes. Due to a set of cosmopolitan OTUs that are found in multiple samples and preferentially sequenced, only 2.1% of the global prokaryotic taxa are represented by sequenced genomes. Most of the biomes were occupied by a few predominant taxa with a high relative abundance and much higher genome-sequenced proportions than numerous rare taxa. CONCLUSIONS: These results reveal the current situation of prokaryotic genome sequencing for earth biomes, provide a more reasonable and efficient exploration of prokaryotic genomes, and promote our understanding of microbial ecological functions.
format Online
Article
Text
id pubmed-7496214
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-74962142020-09-21 Estimate of the sequenced proportion of the global prokaryotic genome Zhang, Zheng Wang, Jianing Wang, Jinlan Wang, Jingjing Li, Yuezhong Microbiome Research BACKGROUND: Sequencing prokaryotic genomes has revolutionized our understanding of the many roles played by microorganisms. However, the cell and taxon proportions of genome-sequenced bacteria or archaea on earth remain unknown. This study aimed to explore this basic question using large-scale alignment between the sequences released by the Earth Microbiome Project and 155,810 prokaryotic genomes from public databases. RESULTS: Our results showed that the median proportions of the genome-sequenced cells and taxa (at 100% identities in the 16S-V4 region) in different biomes reached 38.1% (16.4–86.3%) and 18.8% (9.1–52.6%), respectively. The sequenced proportions of the prokaryotic genomes in biomes were significantly negatively correlated with the alpha diversity indices, and the proportions sequenced in host-associated biomes were significantly higher than those in free-living biomes. Due to a set of cosmopolitan OTUs that are found in multiple samples and preferentially sequenced, only 2.1% of the global prokaryotic taxa are represented by sequenced genomes. Most of the biomes were occupied by a few predominant taxa with a high relative abundance and much higher genome-sequenced proportions than numerous rare taxa. CONCLUSIONS: These results reveal the current situation of prokaryotic genome sequencing for earth biomes, provide a more reasonable and efficient exploration of prokaryotic genomes, and promote our understanding of microbial ecological functions. BioMed Central 2020-09-16 /pmc/articles/PMC7496214/ /pubmed/32938501 http://dx.doi.org/10.1186/s40168-020-00903-z Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zhang, Zheng
Wang, Jianing
Wang, Jinlan
Wang, Jingjing
Li, Yuezhong
Estimate of the sequenced proportion of the global prokaryotic genome
title Estimate of the sequenced proportion of the global prokaryotic genome
title_full Estimate of the sequenced proportion of the global prokaryotic genome
title_fullStr Estimate of the sequenced proportion of the global prokaryotic genome
title_full_unstemmed Estimate of the sequenced proportion of the global prokaryotic genome
title_short Estimate of the sequenced proportion of the global prokaryotic genome
title_sort estimate of the sequenced proportion of the global prokaryotic genome
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7496214/
https://www.ncbi.nlm.nih.gov/pubmed/32938501
http://dx.doi.org/10.1186/s40168-020-00903-z
work_keys_str_mv AT zhangzheng estimateofthesequencedproportionoftheglobalprokaryoticgenome
AT wangjianing estimateofthesequencedproportionoftheglobalprokaryoticgenome
AT wangjinlan estimateofthesequencedproportionoftheglobalprokaryoticgenome
AT wangjingjing estimateofthesequencedproportionoftheglobalprokaryoticgenome
AT liyuezhong estimateofthesequencedproportionoftheglobalprokaryoticgenome