Cargando…
ViralCC retrieves complete viral genomes and virus-host pairs from metagenomic Hi-C data
The introduction of high-throughput chromosome conformation capture (Hi-C) into metagenomics enables reconstructing high-quality metagenome-assembled genomes (MAGs) from microbial communities. Despite recent advances in recovering eukaryotic, bacterial, and archaeal genomes using Hi-C contact maps,...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9889337/ https://www.ncbi.nlm.nih.gov/pubmed/36720887 http://dx.doi.org/10.1038/s41467-023-35945-y |
_version_ | 1784880707726737408 |
---|---|
author | Du, Yuxuan Fuhrman, Jed A. Sun, Fengzhu |
author_facet | Du, Yuxuan Fuhrman, Jed A. Sun, Fengzhu |
author_sort | Du, Yuxuan |
collection | PubMed |
description | The introduction of high-throughput chromosome conformation capture (Hi-C) into metagenomics enables reconstructing high-quality metagenome-assembled genomes (MAGs) from microbial communities. Despite recent advances in recovering eukaryotic, bacterial, and archaeal genomes using Hi-C contact maps, few of Hi-C-based methods are designed to retrieve viral genomes. Here we introduce ViralCC, a publicly available tool to recover complete viral genomes and detect virus-host pairs using Hi-C data. Compared to other Hi-C-based methods, ViralCC leverages the virus-host proximity structure as a complementary information source for the Hi-C interactions. Using mock and real metagenomic Hi-C datasets from several different microbial ecosystems, including the human gut, cow fecal, and wastewater, we demonstrate that ViralCC outperforms existing Hi-C-based binning methods as well as state-of-the-art tools specifically dedicated to metagenomic viral binning. ViralCC can also reveal the taxonomic structure of viruses and virus-host pairs in microbial communities. When applied to a real wastewater metagenomic Hi-C dataset, ViralCC constructs a phage-host network, which is further validated using CRISPR spacer analyses. ViralCC is an open-source pipeline available at https://github.com/dyxstat/ViralCC. |
format | Online Article Text |
id | pubmed-9889337 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-98893372023-02-02 ViralCC retrieves complete viral genomes and virus-host pairs from metagenomic Hi-C data Du, Yuxuan Fuhrman, Jed A. Sun, Fengzhu Nat Commun Article The introduction of high-throughput chromosome conformation capture (Hi-C) into metagenomics enables reconstructing high-quality metagenome-assembled genomes (MAGs) from microbial communities. Despite recent advances in recovering eukaryotic, bacterial, and archaeal genomes using Hi-C contact maps, few of Hi-C-based methods are designed to retrieve viral genomes. Here we introduce ViralCC, a publicly available tool to recover complete viral genomes and detect virus-host pairs using Hi-C data. Compared to other Hi-C-based methods, ViralCC leverages the virus-host proximity structure as a complementary information source for the Hi-C interactions. Using mock and real metagenomic Hi-C datasets from several different microbial ecosystems, including the human gut, cow fecal, and wastewater, we demonstrate that ViralCC outperforms existing Hi-C-based binning methods as well as state-of-the-art tools specifically dedicated to metagenomic viral binning. ViralCC can also reveal the taxonomic structure of viruses and virus-host pairs in microbial communities. When applied to a real wastewater metagenomic Hi-C dataset, ViralCC constructs a phage-host network, which is further validated using CRISPR spacer analyses. ViralCC is an open-source pipeline available at https://github.com/dyxstat/ViralCC. Nature Publishing Group UK 2023-01-31 /pmc/articles/PMC9889337/ /pubmed/36720887 http://dx.doi.org/10.1038/s41467-023-35945-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Du, Yuxuan Fuhrman, Jed A. Sun, Fengzhu ViralCC retrieves complete viral genomes and virus-host pairs from metagenomic Hi-C data |
title | ViralCC retrieves complete viral genomes and virus-host pairs from metagenomic Hi-C data |
title_full | ViralCC retrieves complete viral genomes and virus-host pairs from metagenomic Hi-C data |
title_fullStr | ViralCC retrieves complete viral genomes and virus-host pairs from metagenomic Hi-C data |
title_full_unstemmed | ViralCC retrieves complete viral genomes and virus-host pairs from metagenomic Hi-C data |
title_short | ViralCC retrieves complete viral genomes and virus-host pairs from metagenomic Hi-C data |
title_sort | viralcc retrieves complete viral genomes and virus-host pairs from metagenomic hi-c data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9889337/ https://www.ncbi.nlm.nih.gov/pubmed/36720887 http://dx.doi.org/10.1038/s41467-023-35945-y |
work_keys_str_mv | AT duyuxuan viralccretrievescompleteviralgenomesandvirushostpairsfrommetagenomichicdata AT fuhrmanjeda viralccretrievescompleteviralgenomesandvirushostpairsfrommetagenomichicdata AT sunfengzhu viralccretrievescompleteviralgenomesandvirushostpairsfrommetagenomichicdata |