Cargando…
Highly divergent ancient gene families in metagenomic samples are compatible with additional divisions of life
BACKGROUND: Microbial genetic diversity is often investigated via the comparison of relatively similar 16S molecules through multiple alignments between reference sequences and novel environmental samples using phylogenetic trees, direct BLAST matches, or phylotypes counts. However, are we missing n...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4624368/ https://www.ncbi.nlm.nih.gov/pubmed/26502935 http://dx.doi.org/10.1186/s13062-015-0092-3 |
_version_ | 1782397804069519360 |
---|---|
author | Lopez, Philippe Halary, Sébastien Bapteste, Eric |
author_facet | Lopez, Philippe Halary, Sébastien Bapteste, Eric |
author_sort | Lopez, Philippe |
collection | PubMed |
description | BACKGROUND: Microbial genetic diversity is often investigated via the comparison of relatively similar 16S molecules through multiple alignments between reference sequences and novel environmental samples using phylogenetic trees, direct BLAST matches, or phylotypes counts. However, are we missing novel lineages in the microbial dark universe by relying on standard phylogenetic and BLAST methods? If so, how can we probe that universe using alternative approaches? We performed a novel type of multi-marker analysis of genetic diversity exploiting the topology of inclusive sequence similarity networks. RESULTS: Our protocol identified 86 ancient gene families, well distributed and rarely transferred across the 3 domains of life, and retrieved their environmental homologs among 10 million predicted ORFs from human gut samples and other metagenomic projects. Numerous highly divergent environmental homologs were observed in gut samples, although the most divergent genes were over-represented in non-gut environments. In our networks, most divergent environmental genes grouped exclusively with uncultured relatives, in maximal cliques. Sequences within these groups were under strong purifying selection and presented a range of genetic variation comparable to that of a prokaryotic domain. CONCLUSIONS: Many genes families included environmental homologs that were highly divergent from cultured homologs: in 79 gene families (including 18 ribosomal proteins), Bacteria and Archaea were less divergent than some groups of environmental sequences were to any cultured or viral homologs. Moreover, some groups of environmental homologs branched very deeply in phylogenetic trees of life, when they were not too divergent to be aligned. These results underline how limited our understanding of the most diverse elements of the microbial world remains, and encourage a deeper exploration of natural communities and their genetic resources, hinting at the possibility that still unknown yet major divisions of life have yet to be discovered. REVIEWERS: This article was reviewed by Eugene Koonin, William Martin and James McInerney. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13062-015-0092-3) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4624368 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-46243682015-10-29 Highly divergent ancient gene families in metagenomic samples are compatible with additional divisions of life Lopez, Philippe Halary, Sébastien Bapteste, Eric Biol Direct Research BACKGROUND: Microbial genetic diversity is often investigated via the comparison of relatively similar 16S molecules through multiple alignments between reference sequences and novel environmental samples using phylogenetic trees, direct BLAST matches, or phylotypes counts. However, are we missing novel lineages in the microbial dark universe by relying on standard phylogenetic and BLAST methods? If so, how can we probe that universe using alternative approaches? We performed a novel type of multi-marker analysis of genetic diversity exploiting the topology of inclusive sequence similarity networks. RESULTS: Our protocol identified 86 ancient gene families, well distributed and rarely transferred across the 3 domains of life, and retrieved their environmental homologs among 10 million predicted ORFs from human gut samples and other metagenomic projects. Numerous highly divergent environmental homologs were observed in gut samples, although the most divergent genes were over-represented in non-gut environments. In our networks, most divergent environmental genes grouped exclusively with uncultured relatives, in maximal cliques. Sequences within these groups were under strong purifying selection and presented a range of genetic variation comparable to that of a prokaryotic domain. CONCLUSIONS: Many genes families included environmental homologs that were highly divergent from cultured homologs: in 79 gene families (including 18 ribosomal proteins), Bacteria and Archaea were less divergent than some groups of environmental sequences were to any cultured or viral homologs. Moreover, some groups of environmental homologs branched very deeply in phylogenetic trees of life, when they were not too divergent to be aligned. These results underline how limited our understanding of the most diverse elements of the microbial world remains, and encourage a deeper exploration of natural communities and their genetic resources, hinting at the possibility that still unknown yet major divisions of life have yet to be discovered. REVIEWERS: This article was reviewed by Eugene Koonin, William Martin and James McInerney. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13062-015-0092-3) contains supplementary material, which is available to authorized users. BioMed Central 2015-10-26 /pmc/articles/PMC4624368/ /pubmed/26502935 http://dx.doi.org/10.1186/s13062-015-0092-3 Text en © Lopez et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Lopez, Philippe Halary, Sébastien Bapteste, Eric Highly divergent ancient gene families in metagenomic samples are compatible with additional divisions of life |
title | Highly divergent ancient gene families in metagenomic samples are compatible with additional divisions of life |
title_full | Highly divergent ancient gene families in metagenomic samples are compatible with additional divisions of life |
title_fullStr | Highly divergent ancient gene families in metagenomic samples are compatible with additional divisions of life |
title_full_unstemmed | Highly divergent ancient gene families in metagenomic samples are compatible with additional divisions of life |
title_short | Highly divergent ancient gene families in metagenomic samples are compatible with additional divisions of life |
title_sort | highly divergent ancient gene families in metagenomic samples are compatible with additional divisions of life |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4624368/ https://www.ncbi.nlm.nih.gov/pubmed/26502935 http://dx.doi.org/10.1186/s13062-015-0092-3 |
work_keys_str_mv | AT lopezphilippe highlydivergentancientgenefamiliesinmetagenomicsamplesarecompatiblewithadditionaldivisionsoflife AT halarysebastien highlydivergentancientgenefamiliesinmetagenomicsamplesarecompatiblewithadditionaldivisionsoflife AT baptesteeric highlydivergentancientgenefamiliesinmetagenomicsamplesarecompatiblewithadditionaldivisionsoflife |