Cargando…

Linking pangenomes and metagenomes: the Prochlorococcus metapangenome

Pangenomes offer detailed characterizations of core and accessory genes found in a set of closely related microbial genomes, generally by clustering genes based on sequence homology. In comparison, metagenomes facilitate highly resolved investigations of the relative distribution of microbial genome...

Descripción completa

Detalles Bibliográficos
Autores principales: Delmont, Tom O., Eren, A. Murat
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5804319/
https://www.ncbi.nlm.nih.gov/pubmed/29423345
http://dx.doi.org/10.7717/peerj.4320
_version_ 1783298824128692224
author Delmont, Tom O.
Eren, A. Murat
author_facet Delmont, Tom O.
Eren, A. Murat
author_sort Delmont, Tom O.
collection PubMed
description Pangenomes offer detailed characterizations of core and accessory genes found in a set of closely related microbial genomes, generally by clustering genes based on sequence homology. In comparison, metagenomes facilitate highly resolved investigations of the relative distribution of microbial genomes and individual genes across environments through read recruitment analyses. Combining these complementary approaches can yield unique insights into the functional basis of microbial niche partitioning and fitness, however, advanced software solutions are lacking. Here we present an integrated analysis and visualization strategy that provides an interactive and reproducible framework to generate pangenomes and to study them in conjunction with metagenomes. To investigate its utility, we applied this strategy to a Prochlorococcus pangenome in the context of a large-scale marine metagenomic survey. The resulting Prochlorococcus metapangenome revealed remarkable differential abundance patterns between very closely related isolates that belonged to the same phylogenetic cluster and that differed by only a small number of gene clusters in the pangenome. While the relationships between these genomes based on gene clusters correlated with their environmental distribution patterns, phylogenetic analyses using marker genes or concatenated single-copy core genes did not recapitulate these patterns. The metapangenome also revealed a small set of core genes that mostly occurred in hypervariable genomic islands of the Prochlorococcus populations, which systematically lacked read recruitment from surface ocean metagenomes. Notably, these core gene clusters were all linked to sugar metabolism, suggesting potential benefits to Prochlorococcus from a high sequence diversity of sugar metabolism genes. The rapidly growing number of microbial genomes and increasing availability of environmental metagenomes provide new opportunities to investigate the functioning and the ecology of microbial populations, and metapangenomes can provide unique insights for any taxon and biome for which genomic and sufficiently deep metagenomic data are available.
format Online
Article
Text
id pubmed-5804319
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-58043192018-02-08 Linking pangenomes and metagenomes: the Prochlorococcus metapangenome Delmont, Tom O. Eren, A. Murat PeerJ Bioinformatics Pangenomes offer detailed characterizations of core and accessory genes found in a set of closely related microbial genomes, generally by clustering genes based on sequence homology. In comparison, metagenomes facilitate highly resolved investigations of the relative distribution of microbial genomes and individual genes across environments through read recruitment analyses. Combining these complementary approaches can yield unique insights into the functional basis of microbial niche partitioning and fitness, however, advanced software solutions are lacking. Here we present an integrated analysis and visualization strategy that provides an interactive and reproducible framework to generate pangenomes and to study them in conjunction with metagenomes. To investigate its utility, we applied this strategy to a Prochlorococcus pangenome in the context of a large-scale marine metagenomic survey. The resulting Prochlorococcus metapangenome revealed remarkable differential abundance patterns between very closely related isolates that belonged to the same phylogenetic cluster and that differed by only a small number of gene clusters in the pangenome. While the relationships between these genomes based on gene clusters correlated with their environmental distribution patterns, phylogenetic analyses using marker genes or concatenated single-copy core genes did not recapitulate these patterns. The metapangenome also revealed a small set of core genes that mostly occurred in hypervariable genomic islands of the Prochlorococcus populations, which systematically lacked read recruitment from surface ocean metagenomes. Notably, these core gene clusters were all linked to sugar metabolism, suggesting potential benefits to Prochlorococcus from a high sequence diversity of sugar metabolism genes. The rapidly growing number of microbial genomes and increasing availability of environmental metagenomes provide new opportunities to investigate the functioning and the ecology of microbial populations, and metapangenomes can provide unique insights for any taxon and biome for which genomic and sufficiently deep metagenomic data are available. PeerJ Inc. 2018-01-25 /pmc/articles/PMC5804319/ /pubmed/29423345 http://dx.doi.org/10.7717/peerj.4320 Text en ©2018 Delmont and Eren http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Delmont, Tom O.
Eren, A. Murat
Linking pangenomes and metagenomes: the Prochlorococcus metapangenome
title Linking pangenomes and metagenomes: the Prochlorococcus metapangenome
title_full Linking pangenomes and metagenomes: the Prochlorococcus metapangenome
title_fullStr Linking pangenomes and metagenomes: the Prochlorococcus metapangenome
title_full_unstemmed Linking pangenomes and metagenomes: the Prochlorococcus metapangenome
title_short Linking pangenomes and metagenomes: the Prochlorococcus metapangenome
title_sort linking pangenomes and metagenomes: the prochlorococcus metapangenome
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5804319/
https://www.ncbi.nlm.nih.gov/pubmed/29423345
http://dx.doi.org/10.7717/peerj.4320
work_keys_str_mv AT delmonttomo linkingpangenomesandmetagenomestheprochlorococcusmetapangenome
AT erenamurat linkingpangenomesandmetagenomestheprochlorococcusmetapangenome