Cargando…

PhagePhisher: a pipeline for the discovery of covert viral sequences in complex genomic datasets

Obtaining meaningful viral information from large sequencing datasets presents unique challenges distinct from prokaryotic and eukaryotic sequencing efforts. The difficulties surrounding this issue can be ascribed in part to the genomic plasticity of viruses themselves as well as the scarcity of exi...

Descripción completa

Detalles Bibliográficos
Autores principales: Hatzopoulos, Thomas, Watkins, Siobhan C., Putonti, Catherine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Microbiology Society 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5320576/
https://www.ncbi.nlm.nih.gov/pubmed/28348848
http://dx.doi.org/10.1099/mgen.0.000053
_version_ 1782509565240147968
author Hatzopoulos, Thomas
Watkins, Siobhan C.
Putonti, Catherine
author_facet Hatzopoulos, Thomas
Watkins, Siobhan C.
Putonti, Catherine
author_sort Hatzopoulos, Thomas
collection PubMed
description Obtaining meaningful viral information from large sequencing datasets presents unique challenges distinct from prokaryotic and eukaryotic sequencing efforts. The difficulties surrounding this issue can be ascribed in part to the genomic plasticity of viruses themselves as well as the scarcity of existing information in genomic databases. The open-source software PhagePhisher (http://www.putonti-lab.com/phagephisher) has been designed as a simple pipeline to extract relevant information from complex and mixed datasets, and will improve the examination of bacteriophages, viruses, and virally related sequences, in a range of environments. Key aspects of the software include speed and ease of use; PhagePhisher can be used with limited operator knowledge of bioinformatics on a standard workstation. As a proof-of-concept, PhagePhisher was successfully implemented with bacteria–virus mixed samples of varying complexity. Furthermore, viral signals within microbial metagenomic datasets were easily and quickly identified by PhagePhisher, including those from prophages as well as lysogenic phages, an important and often neglected aspect of examining phage populations in the environment. PhagePhisher resolves viral-related sequences which may be obscured by or imbedded in bacterial genomes.
format Online
Article
Text
id pubmed-5320576
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Microbiology Society
record_format MEDLINE/PubMed
spelling pubmed-53205762017-03-27 PhagePhisher: a pipeline for the discovery of covert viral sequences in complex genomic datasets Hatzopoulos, Thomas Watkins, Siobhan C. Putonti, Catherine Microb Genom Methods Paper Obtaining meaningful viral information from large sequencing datasets presents unique challenges distinct from prokaryotic and eukaryotic sequencing efforts. The difficulties surrounding this issue can be ascribed in part to the genomic plasticity of viruses themselves as well as the scarcity of existing information in genomic databases. The open-source software PhagePhisher (http://www.putonti-lab.com/phagephisher) has been designed as a simple pipeline to extract relevant information from complex and mixed datasets, and will improve the examination of bacteriophages, viruses, and virally related sequences, in a range of environments. Key aspects of the software include speed and ease of use; PhagePhisher can be used with limited operator knowledge of bioinformatics on a standard workstation. As a proof-of-concept, PhagePhisher was successfully implemented with bacteria–virus mixed samples of varying complexity. Furthermore, viral signals within microbial metagenomic datasets were easily and quickly identified by PhagePhisher, including those from prophages as well as lysogenic phages, an important and often neglected aspect of examining phage populations in the environment. PhagePhisher resolves viral-related sequences which may be obscured by or imbedded in bacterial genomes. Microbiology Society 2016-03-10 /pmc/articles/PMC5320576/ /pubmed/28348848 http://dx.doi.org/10.1099/mgen.0.000053 Text en © 2016 The Authors http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Methods Paper
Hatzopoulos, Thomas
Watkins, Siobhan C.
Putonti, Catherine
PhagePhisher: a pipeline for the discovery of covert viral sequences in complex genomic datasets
title PhagePhisher: a pipeline for the discovery of covert viral sequences in complex genomic datasets
title_full PhagePhisher: a pipeline for the discovery of covert viral sequences in complex genomic datasets
title_fullStr PhagePhisher: a pipeline for the discovery of covert viral sequences in complex genomic datasets
title_full_unstemmed PhagePhisher: a pipeline for the discovery of covert viral sequences in complex genomic datasets
title_short PhagePhisher: a pipeline for the discovery of covert viral sequences in complex genomic datasets
title_sort phagephisher: a pipeline for the discovery of covert viral sequences in complex genomic datasets
topic Methods Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5320576/
https://www.ncbi.nlm.nih.gov/pubmed/28348848
http://dx.doi.org/10.1099/mgen.0.000053
work_keys_str_mv AT hatzopoulosthomas phagephisherapipelineforthediscoveryofcovertviralsequencesincomplexgenomicdatasets
AT watkinssiobhanc phagephisherapipelineforthediscoveryofcovertviralsequencesincomplexgenomicdatasets
AT putonticatherine phagephisherapipelineforthediscoveryofcovertviralsequencesincomplexgenomicdatasets