Cargando…

High-throughput identification of viral termini and packaging mechanisms in virome datasets using PhageTermVirome

Viruses that infect bacteria (phages) are increasingly recognized for their importance in diverse ecosystems but identifying and annotating them in large-scale sequence datasets is still challenging. Although efficient scalable virus identification tools are emerging, defining the exact ends (termin...

Descripción completa

Detalles Bibliográficos
Autores principales: Garneau, Julian R., Legrand, Véronique, Marbouty, Martial, Press, Maximilian O., Vik, Dean R., Fortier, Louis-Charles, Sullivan, Matthew B., Bikard, David, Monot, Marc
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8443750/
https://www.ncbi.nlm.nih.gov/pubmed/34526611
http://dx.doi.org/10.1038/s41598-021-97867-3
_version_ 1784568352069386240
author Garneau, Julian R.
Legrand, Véronique
Marbouty, Martial
Press, Maximilian O.
Vik, Dean R.
Fortier, Louis-Charles
Sullivan, Matthew B.
Bikard, David
Monot, Marc
author_facet Garneau, Julian R.
Legrand, Véronique
Marbouty, Martial
Press, Maximilian O.
Vik, Dean R.
Fortier, Louis-Charles
Sullivan, Matthew B.
Bikard, David
Monot, Marc
author_sort Garneau, Julian R.
collection PubMed
description Viruses that infect bacteria (phages) are increasingly recognized for their importance in diverse ecosystems but identifying and annotating them in large-scale sequence datasets is still challenging. Although efficient scalable virus identification tools are emerging, defining the exact ends (termini) of phage genomes is still particularly difficult. The proper identification of termini is crucial, as it helps in characterizing the packaging mechanism of bacteriophages and provides information on various aspects of phage biology. Here, we introduce PhageTermVirome (PTV) as a tool for the easy and rapid high-throughput determination of phage termini and packaging mechanisms using modern large-scale metagenomics datasets. We successfully tested the PTV algorithm on a mock virome dataset and then used it on two real virome datasets to achieve the rapid identification of more than 100 phage termini and packaging mechanisms, with just a few hours of computing time. Because PTV allows the identification of free fully formed viral particles (by recognition of termini present only in encapsidated DNA), it can also complement other virus identification softwares to predict the true viral origin of contigs in viral metagenomics datasets. PTV is a novel and unique tool for high-throughput characterization of phage genomes, including phage termini identification and characterization of genome packaging mechanisms. This software should help researchers better visualize, map and study the virosphere. PTV is freely available for downloading and installation at https://gitlab.pasteur.fr/vlegrand/ptv.
format Online
Article
Text
id pubmed-8443750
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-84437502021-09-20 High-throughput identification of viral termini and packaging mechanisms in virome datasets using PhageTermVirome Garneau, Julian R. Legrand, Véronique Marbouty, Martial Press, Maximilian O. Vik, Dean R. Fortier, Louis-Charles Sullivan, Matthew B. Bikard, David Monot, Marc Sci Rep Article Viruses that infect bacteria (phages) are increasingly recognized for their importance in diverse ecosystems but identifying and annotating them in large-scale sequence datasets is still challenging. Although efficient scalable virus identification tools are emerging, defining the exact ends (termini) of phage genomes is still particularly difficult. The proper identification of termini is crucial, as it helps in characterizing the packaging mechanism of bacteriophages and provides information on various aspects of phage biology. Here, we introduce PhageTermVirome (PTV) as a tool for the easy and rapid high-throughput determination of phage termini and packaging mechanisms using modern large-scale metagenomics datasets. We successfully tested the PTV algorithm on a mock virome dataset and then used it on two real virome datasets to achieve the rapid identification of more than 100 phage termini and packaging mechanisms, with just a few hours of computing time. Because PTV allows the identification of free fully formed viral particles (by recognition of termini present only in encapsidated DNA), it can also complement other virus identification softwares to predict the true viral origin of contigs in viral metagenomics datasets. PTV is a novel and unique tool for high-throughput characterization of phage genomes, including phage termini identification and characterization of genome packaging mechanisms. This software should help researchers better visualize, map and study the virosphere. PTV is freely available for downloading and installation at https://gitlab.pasteur.fr/vlegrand/ptv. Nature Publishing Group UK 2021-09-15 /pmc/articles/PMC8443750/ /pubmed/34526611 http://dx.doi.org/10.1038/s41598-021-97867-3 Text en © The Author(s) 2021, corrected publication 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Garneau, Julian R.
Legrand, Véronique
Marbouty, Martial
Press, Maximilian O.
Vik, Dean R.
Fortier, Louis-Charles
Sullivan, Matthew B.
Bikard, David
Monot, Marc
High-throughput identification of viral termini and packaging mechanisms in virome datasets using PhageTermVirome
title High-throughput identification of viral termini and packaging mechanisms in virome datasets using PhageTermVirome
title_full High-throughput identification of viral termini and packaging mechanisms in virome datasets using PhageTermVirome
title_fullStr High-throughput identification of viral termini and packaging mechanisms in virome datasets using PhageTermVirome
title_full_unstemmed High-throughput identification of viral termini and packaging mechanisms in virome datasets using PhageTermVirome
title_short High-throughput identification of viral termini and packaging mechanisms in virome datasets using PhageTermVirome
title_sort high-throughput identification of viral termini and packaging mechanisms in virome datasets using phagetermvirome
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8443750/
https://www.ncbi.nlm.nih.gov/pubmed/34526611
http://dx.doi.org/10.1038/s41598-021-97867-3
work_keys_str_mv AT garneaujulianr highthroughputidentificationofviralterminiandpackagingmechanismsinviromedatasetsusingphagetermvirome
AT legrandveronique highthroughputidentificationofviralterminiandpackagingmechanismsinviromedatasetsusingphagetermvirome
AT marboutymartial highthroughputidentificationofviralterminiandpackagingmechanismsinviromedatasetsusingphagetermvirome
AT pressmaximiliano highthroughputidentificationofviralterminiandpackagingmechanismsinviromedatasetsusingphagetermvirome
AT vikdeanr highthroughputidentificationofviralterminiandpackagingmechanismsinviromedatasetsusingphagetermvirome
AT fortierlouischarles highthroughputidentificationofviralterminiandpackagingmechanismsinviromedatasetsusingphagetermvirome
AT sullivanmatthewb highthroughputidentificationofviralterminiandpackagingmechanismsinviromedatasetsusingphagetermvirome
AT bikarddavid highthroughputidentificationofviralterminiandpackagingmechanismsinviromedatasetsusingphagetermvirome
AT monotmarc highthroughputidentificationofviralterminiandpackagingmechanismsinviromedatasetsusingphagetermvirome