Cargando…
The use of informativity in the development of robust viromics-based examinations
Metagenomics-based studies have provided insight into many of the complex microbial communities responsible for maintaining life on this planet. Sequencing efforts often uncover novel genetic content; this is most evident for phage communities, in which upwards of 90% of all sequences exhibit no sim...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5417064/ https://www.ncbi.nlm.nih.gov/pubmed/28480148 http://dx.doi.org/10.7717/peerj.3281 |
_version_ | 1783233860406870016 |
---|---|
author | Watkins, Siobhan C. Putonti, Catherine |
author_facet | Watkins, Siobhan C. Putonti, Catherine |
author_sort | Watkins, Siobhan C. |
collection | PubMed |
description | Metagenomics-based studies have provided insight into many of the complex microbial communities responsible for maintaining life on this planet. Sequencing efforts often uncover novel genetic content; this is most evident for phage communities, in which upwards of 90% of all sequences exhibit no similarity to any sequence in current data repositories. For the small fraction that can be identified, the top BLAST hit is generally posited as being representative of a viral taxon present in the sample of origin. Homology-based classification, however, can be misleading as sequence repositories capture but a small fraction of phage diversity. Furthermore, lateral gene transfer is pervasive within phage communities. As such, the presence of a particular gene may not be indicative of the presence of a particular viral species. Rather, it is just that: an indication of the presence of a specific gene. To circumvent this limitation, we have developed a new method for the analysis of viral metagenomic datasets. BLAST hits are weighted, integrating the sequence identity and length of alignments as well as a taxonomic signal, such that each gene is evaluated with respect to its information content. Through this quantifiable metric, predictions of viral community structure can be made with confidence. As a proof-of-concept, the approach presented here was implemented and applied to seven freshwater viral metagenomes. While providing a robust method for evaluating viral metagenomic data, the tool is versatile and can easily be customized to investigations of any environment or biome. |
format | Online Article Text |
id | pubmed-5417064 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-54170642017-05-05 The use of informativity in the development of robust viromics-based examinations Watkins, Siobhan C. Putonti, Catherine PeerJ Bioinformatics Metagenomics-based studies have provided insight into many of the complex microbial communities responsible for maintaining life on this planet. Sequencing efforts often uncover novel genetic content; this is most evident for phage communities, in which upwards of 90% of all sequences exhibit no similarity to any sequence in current data repositories. For the small fraction that can be identified, the top BLAST hit is generally posited as being representative of a viral taxon present in the sample of origin. Homology-based classification, however, can be misleading as sequence repositories capture but a small fraction of phage diversity. Furthermore, lateral gene transfer is pervasive within phage communities. As such, the presence of a particular gene may not be indicative of the presence of a particular viral species. Rather, it is just that: an indication of the presence of a specific gene. To circumvent this limitation, we have developed a new method for the analysis of viral metagenomic datasets. BLAST hits are weighted, integrating the sequence identity and length of alignments as well as a taxonomic signal, such that each gene is evaluated with respect to its information content. Through this quantifiable metric, predictions of viral community structure can be made with confidence. As a proof-of-concept, the approach presented here was implemented and applied to seven freshwater viral metagenomes. While providing a robust method for evaluating viral metagenomic data, the tool is versatile and can easily be customized to investigations of any environment or biome. PeerJ Inc. 2017-05-02 /pmc/articles/PMC5417064/ /pubmed/28480148 http://dx.doi.org/10.7717/peerj.3281 Text en ©2017 Watkins and Putonti http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Watkins, Siobhan C. Putonti, Catherine The use of informativity in the development of robust viromics-based examinations |
title | The use of informativity in the development of robust viromics-based examinations |
title_full | The use of informativity in the development of robust viromics-based examinations |
title_fullStr | The use of informativity in the development of robust viromics-based examinations |
title_full_unstemmed | The use of informativity in the development of robust viromics-based examinations |
title_short | The use of informativity in the development of robust viromics-based examinations |
title_sort | use of informativity in the development of robust viromics-based examinations |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5417064/ https://www.ncbi.nlm.nih.gov/pubmed/28480148 http://dx.doi.org/10.7717/peerj.3281 |
work_keys_str_mv | AT watkinssiobhanc theuseofinformativityinthedevelopmentofrobustviromicsbasedexaminations AT putonticatherine theuseofinformativityinthedevelopmentofrobustviromicsbasedexaminations AT watkinssiobhanc useofinformativityinthedevelopmentofrobustviromicsbasedexaminations AT putonticatherine useofinformativityinthedevelopmentofrobustviromicsbasedexaminations |