Cargando…

Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes

Phages are the most abundant biological entities on Earth and play major ecological roles, yet the current sequenced phage genomes do not adequately represent their diversity, and little is known about the abundance and distribution of these sequenced genomes in nature. Although the study of phage e...

Descripción completa

Detalles Bibliográficos
Autores principales: Aziz, Ramy K., Dwivedi, Bhakti, Akhter, Sajia, Breitbart, Mya, Edwards, Robert A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4424905/
https://www.ncbi.nlm.nih.gov/pubmed/26005436
http://dx.doi.org/10.3389/fmicb.2015.00381
_version_ 1782370401851015168
author Aziz, Ramy K.
Dwivedi, Bhakti
Akhter, Sajia
Breitbart, Mya
Edwards, Robert A.
author_facet Aziz, Ramy K.
Dwivedi, Bhakti
Akhter, Sajia
Breitbart, Mya
Edwards, Robert A.
author_sort Aziz, Ramy K.
collection PubMed
description Phages are the most abundant biological entities on Earth and play major ecological roles, yet the current sequenced phage genomes do not adequately represent their diversity, and little is known about the abundance and distribution of these sequenced genomes in nature. Although the study of phage ecology has benefited tremendously from the emergence of metagenomic sequencing, a systematic survey of phage genes and genomes in various ecosystems is still lacking, and fundamental questions about phage biology, lifestyle, and ecology remain unanswered. To address these questions and improve comparative analysis of phages in different metagenomes, we screened a core set of publicly available metagenomic samples for sequences related to completely sequenced phages using the web tool, Phage Eco-Locator. We then adopted and deployed an array of mathematical and statistical metrics for a multidimensional estimation of the abundance and distribution of phage genes and genomes in various ecosystems. Experiments using those metrics individually showed their usefulness in emphasizing the pervasive, yet uneven, distribution of known phage sequences in environmental metagenomes. Using these metrics in combination allowed us to resolve phage genomes into clusters that correlated with their genotypes and taxonomic classes as well as their ecological properties. We propose adding this set of metrics to current metaviromic analysis pipelines, where they can provide insight regarding phage mosaicism, habitat specificity, and evolution.
format Online
Article
Text
id pubmed-4424905
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-44249052015-05-22 Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes Aziz, Ramy K. Dwivedi, Bhakti Akhter, Sajia Breitbart, Mya Edwards, Robert A. Front Microbiol Microbiology Phages are the most abundant biological entities on Earth and play major ecological roles, yet the current sequenced phage genomes do not adequately represent their diversity, and little is known about the abundance and distribution of these sequenced genomes in nature. Although the study of phage ecology has benefited tremendously from the emergence of metagenomic sequencing, a systematic survey of phage genes and genomes in various ecosystems is still lacking, and fundamental questions about phage biology, lifestyle, and ecology remain unanswered. To address these questions and improve comparative analysis of phages in different metagenomes, we screened a core set of publicly available metagenomic samples for sequences related to completely sequenced phages using the web tool, Phage Eco-Locator. We then adopted and deployed an array of mathematical and statistical metrics for a multidimensional estimation of the abundance and distribution of phage genes and genomes in various ecosystems. Experiments using those metrics individually showed their usefulness in emphasizing the pervasive, yet uneven, distribution of known phage sequences in environmental metagenomes. Using these metrics in combination allowed us to resolve phage genomes into clusters that correlated with their genotypes and taxonomic classes as well as their ecological properties. We propose adding this set of metrics to current metaviromic analysis pipelines, where they can provide insight regarding phage mosaicism, habitat specificity, and evolution. Frontiers Media S.A. 2015-05-08 /pmc/articles/PMC4424905/ /pubmed/26005436 http://dx.doi.org/10.3389/fmicb.2015.00381 Text en Copyright © 2015 Aziz, Dwivedi, Akhter, Breitbart and Edwards. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Aziz, Ramy K.
Dwivedi, Bhakti
Akhter, Sajia
Breitbart, Mya
Edwards, Robert A.
Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes
title Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes
title_full Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes
title_fullStr Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes
title_full_unstemmed Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes
title_short Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes
title_sort multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4424905/
https://www.ncbi.nlm.nih.gov/pubmed/26005436
http://dx.doi.org/10.3389/fmicb.2015.00381
work_keys_str_mv AT azizramyk multidimensionalmetricsforestimatingphageabundancedistributiongenedensityandsequencecoverageinmetagenomes
AT dwivedibhakti multidimensionalmetricsforestimatingphageabundancedistributiongenedensityandsequencecoverageinmetagenomes
AT akhtersajia multidimensionalmetricsforestimatingphageabundancedistributiongenedensityandsequencecoverageinmetagenomes
AT breitbartmya multidimensionalmetricsforestimatingphageabundancedistributiongenedensityandsequencecoverageinmetagenomes
AT edwardsroberta multidimensionalmetricsforestimatingphageabundancedistributiongenedensityandsequencecoverageinmetagenomes