Cargando…
Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes
Phages are the most abundant biological entities on Earth and play major ecological roles, yet the current sequenced phage genomes do not adequately represent their diversity, and little is known about the abundance and distribution of these sequenced genomes in nature. Although the study of phage e...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4424905/ https://www.ncbi.nlm.nih.gov/pubmed/26005436 http://dx.doi.org/10.3389/fmicb.2015.00381 |
_version_ | 1782370401851015168 |
---|---|
author | Aziz, Ramy K. Dwivedi, Bhakti Akhter, Sajia Breitbart, Mya Edwards, Robert A. |
author_facet | Aziz, Ramy K. Dwivedi, Bhakti Akhter, Sajia Breitbart, Mya Edwards, Robert A. |
author_sort | Aziz, Ramy K. |
collection | PubMed |
description | Phages are the most abundant biological entities on Earth and play major ecological roles, yet the current sequenced phage genomes do not adequately represent their diversity, and little is known about the abundance and distribution of these sequenced genomes in nature. Although the study of phage ecology has benefited tremendously from the emergence of metagenomic sequencing, a systematic survey of phage genes and genomes in various ecosystems is still lacking, and fundamental questions about phage biology, lifestyle, and ecology remain unanswered. To address these questions and improve comparative analysis of phages in different metagenomes, we screened a core set of publicly available metagenomic samples for sequences related to completely sequenced phages using the web tool, Phage Eco-Locator. We then adopted and deployed an array of mathematical and statistical metrics for a multidimensional estimation of the abundance and distribution of phage genes and genomes in various ecosystems. Experiments using those metrics individually showed their usefulness in emphasizing the pervasive, yet uneven, distribution of known phage sequences in environmental metagenomes. Using these metrics in combination allowed us to resolve phage genomes into clusters that correlated with their genotypes and taxonomic classes as well as their ecological properties. We propose adding this set of metrics to current metaviromic analysis pipelines, where they can provide insight regarding phage mosaicism, habitat specificity, and evolution. |
format | Online Article Text |
id | pubmed-4424905 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-44249052015-05-22 Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes Aziz, Ramy K. Dwivedi, Bhakti Akhter, Sajia Breitbart, Mya Edwards, Robert A. Front Microbiol Microbiology Phages are the most abundant biological entities on Earth and play major ecological roles, yet the current sequenced phage genomes do not adequately represent their diversity, and little is known about the abundance and distribution of these sequenced genomes in nature. Although the study of phage ecology has benefited tremendously from the emergence of metagenomic sequencing, a systematic survey of phage genes and genomes in various ecosystems is still lacking, and fundamental questions about phage biology, lifestyle, and ecology remain unanswered. To address these questions and improve comparative analysis of phages in different metagenomes, we screened a core set of publicly available metagenomic samples for sequences related to completely sequenced phages using the web tool, Phage Eco-Locator. We then adopted and deployed an array of mathematical and statistical metrics for a multidimensional estimation of the abundance and distribution of phage genes and genomes in various ecosystems. Experiments using those metrics individually showed their usefulness in emphasizing the pervasive, yet uneven, distribution of known phage sequences in environmental metagenomes. Using these metrics in combination allowed us to resolve phage genomes into clusters that correlated with their genotypes and taxonomic classes as well as their ecological properties. We propose adding this set of metrics to current metaviromic analysis pipelines, where they can provide insight regarding phage mosaicism, habitat specificity, and evolution. Frontiers Media S.A. 2015-05-08 /pmc/articles/PMC4424905/ /pubmed/26005436 http://dx.doi.org/10.3389/fmicb.2015.00381 Text en Copyright © 2015 Aziz, Dwivedi, Akhter, Breitbart and Edwards. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Microbiology Aziz, Ramy K. Dwivedi, Bhakti Akhter, Sajia Breitbart, Mya Edwards, Robert A. Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes |
title | Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes |
title_full | Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes |
title_fullStr | Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes |
title_full_unstemmed | Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes |
title_short | Multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes |
title_sort | multidimensional metrics for estimating phage abundance, distribution, gene density, and sequence coverage in metagenomes |
topic | Microbiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4424905/ https://www.ncbi.nlm.nih.gov/pubmed/26005436 http://dx.doi.org/10.3389/fmicb.2015.00381 |
work_keys_str_mv | AT azizramyk multidimensionalmetricsforestimatingphageabundancedistributiongenedensityandsequencecoverageinmetagenomes AT dwivedibhakti multidimensionalmetricsforestimatingphageabundancedistributiongenedensityandsequencecoverageinmetagenomes AT akhtersajia multidimensionalmetricsforestimatingphageabundancedistributiongenedensityandsequencecoverageinmetagenomes AT breitbartmya multidimensionalmetricsforestimatingphageabundancedistributiongenedensityandsequencecoverageinmetagenomes AT edwardsroberta multidimensionalmetricsforestimatingphageabundancedistributiongenedensityandsequencecoverageinmetagenomes |