Cargando…

Quantifying Shared and Unique Gene Content across 17 Microbial Ecosystems

Measuring microbial diversity is traditionally based on microbe taxonomy. Here, in contrast, we aimed to quantify heterogeneity in microbial gene content across 14,183 metagenomic samples spanning 17 ecologies, including 6 human associated, 7 nonhuman host associated, and 4 in other nonhuman host en...

Descripción completa

Detalles Bibliográficos
Autores principales: Zimmerman, Samuel, Tierney, Braden T., Patel, Chirag J., Kostic, Aleksandar D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10134805/
https://www.ncbi.nlm.nih.gov/pubmed/37022232
http://dx.doi.org/10.1128/msystems.00118-23
_version_ 1785031834564820992
author Zimmerman, Samuel
Tierney, Braden T.
Patel, Chirag J.
Kostic, Aleksandar D.
author_facet Zimmerman, Samuel
Tierney, Braden T.
Patel, Chirag J.
Kostic, Aleksandar D.
author_sort Zimmerman, Samuel
collection PubMed
description Measuring microbial diversity is traditionally based on microbe taxonomy. Here, in contrast, we aimed to quantify heterogeneity in microbial gene content across 14,183 metagenomic samples spanning 17 ecologies, including 6 human associated, 7 nonhuman host associated, and 4 in other nonhuman host environments. In total, we identified 117,629,181 nonredundant genes. The vast majority of genes (66%) occurred in only one sample (i.e., “singletons”). In contrast, we found 1,864 sequences present in every metagenome, but not necessarily every bacterial genome. Additionally, we report data sets of other ecology-associated genes (e.g., abundant in only gut ecosystems) and simultaneously demonstrated that prior microbiome gene catalogs are both incomplete and inaccurately cluster microbial genetic life (e.g., at gene sequence identities that are too restrictive). We provide our results and the sets of environmentally differentiating genes described above at http://www.microbial-genes.bio. IMPORTANCE The amount of shared genetic elements has not been quantified between the human microbiome and other host- and non-host-associated microbiomes. Here, we made a gene catalog of 17 different microbial ecosystems and compared them. We show that most species shared between environment and human gut microbiomes are pathogens and that prior gene catalogs described as “nearly complete” are far from it. Additionally, over two-thirds of all genes only appear in a single sample, and only 1,864 genes (0.001%) are found in all types of metagenomes. These results highlight the large diversity between metagenomes and reveal a new, rare class of genes, those found in every type of metagenome, but not every microbial genome.
format Online
Article
Text
id pubmed-10134805
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-101348052023-04-28 Quantifying Shared and Unique Gene Content across 17 Microbial Ecosystems Zimmerman, Samuel Tierney, Braden T. Patel, Chirag J. Kostic, Aleksandar D. mSystems Research Article Measuring microbial diversity is traditionally based on microbe taxonomy. Here, in contrast, we aimed to quantify heterogeneity in microbial gene content across 14,183 metagenomic samples spanning 17 ecologies, including 6 human associated, 7 nonhuman host associated, and 4 in other nonhuman host environments. In total, we identified 117,629,181 nonredundant genes. The vast majority of genes (66%) occurred in only one sample (i.e., “singletons”). In contrast, we found 1,864 sequences present in every metagenome, but not necessarily every bacterial genome. Additionally, we report data sets of other ecology-associated genes (e.g., abundant in only gut ecosystems) and simultaneously demonstrated that prior microbiome gene catalogs are both incomplete and inaccurately cluster microbial genetic life (e.g., at gene sequence identities that are too restrictive). We provide our results and the sets of environmentally differentiating genes described above at http://www.microbial-genes.bio. IMPORTANCE The amount of shared genetic elements has not been quantified between the human microbiome and other host- and non-host-associated microbiomes. Here, we made a gene catalog of 17 different microbial ecosystems and compared them. We show that most species shared between environment and human gut microbiomes are pathogens and that prior gene catalogs described as “nearly complete” are far from it. Additionally, over two-thirds of all genes only appear in a single sample, and only 1,864 genes (0.001%) are found in all types of metagenomes. These results highlight the large diversity between metagenomes and reveal a new, rare class of genes, those found in every type of metagenome, but not every microbial genome. American Society for Microbiology 2023-04-06 /pmc/articles/PMC10134805/ /pubmed/37022232 http://dx.doi.org/10.1128/msystems.00118-23 Text en Copyright © 2023 Zimmerman et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Zimmerman, Samuel
Tierney, Braden T.
Patel, Chirag J.
Kostic, Aleksandar D.
Quantifying Shared and Unique Gene Content across 17 Microbial Ecosystems
title Quantifying Shared and Unique Gene Content across 17 Microbial Ecosystems
title_full Quantifying Shared and Unique Gene Content across 17 Microbial Ecosystems
title_fullStr Quantifying Shared and Unique Gene Content across 17 Microbial Ecosystems
title_full_unstemmed Quantifying Shared and Unique Gene Content across 17 Microbial Ecosystems
title_short Quantifying Shared and Unique Gene Content across 17 Microbial Ecosystems
title_sort quantifying shared and unique gene content across 17 microbial ecosystems
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10134805/
https://www.ncbi.nlm.nih.gov/pubmed/37022232
http://dx.doi.org/10.1128/msystems.00118-23
work_keys_str_mv AT zimmermansamuel quantifyingsharedanduniquegenecontentacross17microbialecosystems
AT tierneybradent quantifyingsharedanduniquegenecontentacross17microbialecosystems
AT patelchiragj quantifyingsharedanduniquegenecontentacross17microbialecosystems
AT kosticaleksandard quantifyingsharedanduniquegenecontentacross17microbialecosystems