Cargando…
Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing
Metagenomics studies have revealed tremendous viral diversity in aquatic environments. Yet, while the genomic data they have provided is extensive, it is unannotated. For example, most phage sequences lack accurate information about their bacterial host, which prevents reliable phage identification...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9862649/ https://www.ncbi.nlm.nih.gov/pubmed/36680116 http://dx.doi.org/10.3390/v15010076 |
_version_ | 1784875141709168640 |
---|---|
author | Andrianjakarivony, Harilanto Felana Bettarel, Yvan Armougom, Fabrice Desnues, Christelle |
author_facet | Andrianjakarivony, Harilanto Felana Bettarel, Yvan Armougom, Fabrice Desnues, Christelle |
author_sort | Andrianjakarivony, Harilanto Felana |
collection | PubMed |
description | Metagenomics studies have revealed tremendous viral diversity in aquatic environments. Yet, while the genomic data they have provided is extensive, it is unannotated. For example, most phage sequences lack accurate information about their bacterial host, which prevents reliable phage identification and the investigation of phage–host interactions. This study aimed to take this knowledge further, using a viral metagenomic framework to decipher the composition and diversity of phage communities and to predict their bacterial hosts. To this end, we used water and sediment samples collected from seven sites with varying contamination levels in the Ebrié Lagoon in Abidjan, Ivory Coast. The bacterial communities were characterized using the 16S rRNA metabarcoding approach, and a framework was developed to investigate the virome datasets that: (1) identified phage contigs with VirSorter and VIBRANT; (2) classified these contigs with MetaPhinder using the phage database (taxonomic annotation); and (3) predicted the phages’ bacterial hosts with a machine learning-based tool: the Prokaryotic Virus-Host Predictor. The findings showed that the taxonomic profiles of phages and bacteria were specific to sediment or water samples. Phage sequences assigned to the Microviridae family were widespread in sediment samples, whereas phage sequences assigned to the Siphoviridae, Myoviridae and Podoviridae families were predominant in water samples. In terms of bacterial communities, the phyla Latescibacteria, Zixibacteria, Bacteroidetes, Acidobacteria, Calditrichaeota, Gemmatimonadetes, Cyanobacteria and Patescibacteria were most widespread in sediment samples, while the phyla Epsilonbacteraeota, Tenericutes, Margulisbacteria, Proteobacteria, Actinobacteria, Planctomycetes and Marinimicrobia were most prevalent in water samples. Significantly, the relative abundance of bacterial communities (at major phylum level) estimated by 16S rRNA metabarcoding and phage-host prediction were significantly similar. These results demonstrate the reliability of this novel approach for predicting the bacterial hosts of phages from shotgun metagenomic sequencing data. |
format | Online Article Text |
id | pubmed-9862649 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-98626492023-01-22 Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing Andrianjakarivony, Harilanto Felana Bettarel, Yvan Armougom, Fabrice Desnues, Christelle Viruses Article Metagenomics studies have revealed tremendous viral diversity in aquatic environments. Yet, while the genomic data they have provided is extensive, it is unannotated. For example, most phage sequences lack accurate information about their bacterial host, which prevents reliable phage identification and the investigation of phage–host interactions. This study aimed to take this knowledge further, using a viral metagenomic framework to decipher the composition and diversity of phage communities and to predict their bacterial hosts. To this end, we used water and sediment samples collected from seven sites with varying contamination levels in the Ebrié Lagoon in Abidjan, Ivory Coast. The bacterial communities were characterized using the 16S rRNA metabarcoding approach, and a framework was developed to investigate the virome datasets that: (1) identified phage contigs with VirSorter and VIBRANT; (2) classified these contigs with MetaPhinder using the phage database (taxonomic annotation); and (3) predicted the phages’ bacterial hosts with a machine learning-based tool: the Prokaryotic Virus-Host Predictor. The findings showed that the taxonomic profiles of phages and bacteria were specific to sediment or water samples. Phage sequences assigned to the Microviridae family were widespread in sediment samples, whereas phage sequences assigned to the Siphoviridae, Myoviridae and Podoviridae families were predominant in water samples. In terms of bacterial communities, the phyla Latescibacteria, Zixibacteria, Bacteroidetes, Acidobacteria, Calditrichaeota, Gemmatimonadetes, Cyanobacteria and Patescibacteria were most widespread in sediment samples, while the phyla Epsilonbacteraeota, Tenericutes, Margulisbacteria, Proteobacteria, Actinobacteria, Planctomycetes and Marinimicrobia were most prevalent in water samples. Significantly, the relative abundance of bacterial communities (at major phylum level) estimated by 16S rRNA metabarcoding and phage-host prediction were significantly similar. These results demonstrate the reliability of this novel approach for predicting the bacterial hosts of phages from shotgun metagenomic sequencing data. MDPI 2022-12-27 /pmc/articles/PMC9862649/ /pubmed/36680116 http://dx.doi.org/10.3390/v15010076 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Andrianjakarivony, Harilanto Felana Bettarel, Yvan Armougom, Fabrice Desnues, Christelle Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing |
title | Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing |
title_full | Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing |
title_fullStr | Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing |
title_full_unstemmed | Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing |
title_short | Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing |
title_sort | phage-host prediction using a computational tool coupled with 16s rrna gene amplicon sequencing |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9862649/ https://www.ncbi.nlm.nih.gov/pubmed/36680116 http://dx.doi.org/10.3390/v15010076 |
work_keys_str_mv | AT andrianjakarivonyharilantofelana phagehostpredictionusingacomputationaltoolcoupledwith16srrnageneampliconsequencing AT bettarelyvan phagehostpredictionusingacomputationaltoolcoupledwith16srrnageneampliconsequencing AT armougomfabrice phagehostpredictionusingacomputationaltoolcoupledwith16srrnageneampliconsequencing AT desnueschristelle phagehostpredictionusingacomputationaltoolcoupledwith16srrnageneampliconsequencing |