Cargando…

Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing

Metagenomics studies have revealed tremendous viral diversity in aquatic environments. Yet, while the genomic data they have provided is extensive, it is unannotated. For example, most phage sequences lack accurate information about their bacterial host, which prevents reliable phage identification...

Descripción completa

Detalles Bibliográficos
Autores principales: Andrianjakarivony, Harilanto Felana, Bettarel, Yvan, Armougom, Fabrice, Desnues, Christelle
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9862649/
https://www.ncbi.nlm.nih.gov/pubmed/36680116
http://dx.doi.org/10.3390/v15010076
_version_ 1784875141709168640
author Andrianjakarivony, Harilanto Felana
Bettarel, Yvan
Armougom, Fabrice
Desnues, Christelle
author_facet Andrianjakarivony, Harilanto Felana
Bettarel, Yvan
Armougom, Fabrice
Desnues, Christelle
author_sort Andrianjakarivony, Harilanto Felana
collection PubMed
description Metagenomics studies have revealed tremendous viral diversity in aquatic environments. Yet, while the genomic data they have provided is extensive, it is unannotated. For example, most phage sequences lack accurate information about their bacterial host, which prevents reliable phage identification and the investigation of phage–host interactions. This study aimed to take this knowledge further, using a viral metagenomic framework to decipher the composition and diversity of phage communities and to predict their bacterial hosts. To this end, we used water and sediment samples collected from seven sites with varying contamination levels in the Ebrié Lagoon in Abidjan, Ivory Coast. The bacterial communities were characterized using the 16S rRNA metabarcoding approach, and a framework was developed to investigate the virome datasets that: (1) identified phage contigs with VirSorter and VIBRANT; (2) classified these contigs with MetaPhinder using the phage database (taxonomic annotation); and (3) predicted the phages’ bacterial hosts with a machine learning-based tool: the Prokaryotic Virus-Host Predictor. The findings showed that the taxonomic profiles of phages and bacteria were specific to sediment or water samples. Phage sequences assigned to the Microviridae family were widespread in sediment samples, whereas phage sequences assigned to the Siphoviridae, Myoviridae and Podoviridae families were predominant in water samples. In terms of bacterial communities, the phyla Latescibacteria, Zixibacteria, Bacteroidetes, Acidobacteria, Calditrichaeota, Gemmatimonadetes, Cyanobacteria and Patescibacteria were most widespread in sediment samples, while the phyla Epsilonbacteraeota, Tenericutes, Margulisbacteria, Proteobacteria, Actinobacteria, Planctomycetes and Marinimicrobia were most prevalent in water samples. Significantly, the relative abundance of bacterial communities (at major phylum level) estimated by 16S rRNA metabarcoding and phage-host prediction were significantly similar. These results demonstrate the reliability of this novel approach for predicting the bacterial hosts of phages from shotgun metagenomic sequencing data.
format Online
Article
Text
id pubmed-9862649
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-98626492023-01-22 Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing Andrianjakarivony, Harilanto Felana Bettarel, Yvan Armougom, Fabrice Desnues, Christelle Viruses Article Metagenomics studies have revealed tremendous viral diversity in aquatic environments. Yet, while the genomic data they have provided is extensive, it is unannotated. For example, most phage sequences lack accurate information about their bacterial host, which prevents reliable phage identification and the investigation of phage–host interactions. This study aimed to take this knowledge further, using a viral metagenomic framework to decipher the composition and diversity of phage communities and to predict their bacterial hosts. To this end, we used water and sediment samples collected from seven sites with varying contamination levels in the Ebrié Lagoon in Abidjan, Ivory Coast. The bacterial communities were characterized using the 16S rRNA metabarcoding approach, and a framework was developed to investigate the virome datasets that: (1) identified phage contigs with VirSorter and VIBRANT; (2) classified these contigs with MetaPhinder using the phage database (taxonomic annotation); and (3) predicted the phages’ bacterial hosts with a machine learning-based tool: the Prokaryotic Virus-Host Predictor. The findings showed that the taxonomic profiles of phages and bacteria were specific to sediment or water samples. Phage sequences assigned to the Microviridae family were widespread in sediment samples, whereas phage sequences assigned to the Siphoviridae, Myoviridae and Podoviridae families were predominant in water samples. In terms of bacterial communities, the phyla Latescibacteria, Zixibacteria, Bacteroidetes, Acidobacteria, Calditrichaeota, Gemmatimonadetes, Cyanobacteria and Patescibacteria were most widespread in sediment samples, while the phyla Epsilonbacteraeota, Tenericutes, Margulisbacteria, Proteobacteria, Actinobacteria, Planctomycetes and Marinimicrobia were most prevalent in water samples. Significantly, the relative abundance of bacterial communities (at major phylum level) estimated by 16S rRNA metabarcoding and phage-host prediction were significantly similar. These results demonstrate the reliability of this novel approach for predicting the bacterial hosts of phages from shotgun metagenomic sequencing data. MDPI 2022-12-27 /pmc/articles/PMC9862649/ /pubmed/36680116 http://dx.doi.org/10.3390/v15010076 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Andrianjakarivony, Harilanto Felana
Bettarel, Yvan
Armougom, Fabrice
Desnues, Christelle
Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing
title Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing
title_full Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing
title_fullStr Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing
title_full_unstemmed Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing
title_short Phage-Host Prediction Using a Computational Tool Coupled with 16S rRNA Gene Amplicon Sequencing
title_sort phage-host prediction using a computational tool coupled with 16s rrna gene amplicon sequencing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9862649/
https://www.ncbi.nlm.nih.gov/pubmed/36680116
http://dx.doi.org/10.3390/v15010076
work_keys_str_mv AT andrianjakarivonyharilantofelana phagehostpredictionusingacomputationaltoolcoupledwith16srrnageneampliconsequencing
AT bettarelyvan phagehostpredictionusingacomputationaltoolcoupledwith16srrnageneampliconsequencing
AT armougomfabrice phagehostpredictionusingacomputationaltoolcoupledwith16srrnageneampliconsequencing
AT desnueschristelle phagehostpredictionusingacomputationaltoolcoupledwith16srrnageneampliconsequencing