Cargando…

multiPhATE: bioinformatics pipeline for functional annotation of phage isolates

SUMMARY: To address the need for improved phage annotation tools that scale, we created an automated throughput annotation pipeline: multiple-genome Phage Annotation Toolkit and Evaluator (multiPhATE). multiPhATE is a throughput pipeline driver that invokes an annotation pipeline (PhATE) across a us...

Descripción completa

Detalles Bibliográficos
Autores principales: Ecale Zhou, Carol L, Malfatti, Stephanie, Kimbrel, Jeffrey, Philipson, Casandra, McNair, Katelyn, Hamilton, Theron, Edwards, Robert, Souza, Brian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6821344/
https://www.ncbi.nlm.nih.gov/pubmed/31086982
http://dx.doi.org/10.1093/bioinformatics/btz258
_version_ 1783464126128848896
author Ecale Zhou, Carol L
Malfatti, Stephanie
Kimbrel, Jeffrey
Philipson, Casandra
McNair, Katelyn
Hamilton, Theron
Edwards, Robert
Souza, Brian
author_facet Ecale Zhou, Carol L
Malfatti, Stephanie
Kimbrel, Jeffrey
Philipson, Casandra
McNair, Katelyn
Hamilton, Theron
Edwards, Robert
Souza, Brian
author_sort Ecale Zhou, Carol L
collection PubMed
description SUMMARY: To address the need for improved phage annotation tools that scale, we created an automated throughput annotation pipeline: multiple-genome Phage Annotation Toolkit and Evaluator (multiPhATE). multiPhATE is a throughput pipeline driver that invokes an annotation pipeline (PhATE) across a user-specified set of phage genomes. This tool incorporates a de novo phage gene calling algorithm and assigns putative functions to gene calls using protein-, virus- and phage-centric databases. multiPhATE’s modular construction allows the user to implement all or any portion of the analyses by acquiring local instances of the desired databases and specifying the desired analyses in a configuration file. We demonstrate multiPhATE by annotating two newly sequenced Yersinia pestis phage genomes. Within multiPhATE, the PhATE processing pipeline can be readily implemented across multiple processors, making it adaptable for throughput sequencing projects. Software documentation assists the user in configuring the system. AVAILABILITY AND IMPLEMENTATION: multiPhATE was implemented in Python 3.7, and runs as a command-line code under Linux or Unix. multiPhATE is freely available under an open-source BSD3 license from https://github.com/carolzhou/multiPhATE. Instructions for acquiring the databases and third-party codes used by multiPhATE are included in the distribution README file. Users may report bugs by submitting to the github issues page associated with the multiPhATE distribution. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6821344
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-68213442019-11-04 multiPhATE: bioinformatics pipeline for functional annotation of phage isolates Ecale Zhou, Carol L Malfatti, Stephanie Kimbrel, Jeffrey Philipson, Casandra McNair, Katelyn Hamilton, Theron Edwards, Robert Souza, Brian Bioinformatics Applications Notes SUMMARY: To address the need for improved phage annotation tools that scale, we created an automated throughput annotation pipeline: multiple-genome Phage Annotation Toolkit and Evaluator (multiPhATE). multiPhATE is a throughput pipeline driver that invokes an annotation pipeline (PhATE) across a user-specified set of phage genomes. This tool incorporates a de novo phage gene calling algorithm and assigns putative functions to gene calls using protein-, virus- and phage-centric databases. multiPhATE’s modular construction allows the user to implement all or any portion of the analyses by acquiring local instances of the desired databases and specifying the desired analyses in a configuration file. We demonstrate multiPhATE by annotating two newly sequenced Yersinia pestis phage genomes. Within multiPhATE, the PhATE processing pipeline can be readily implemented across multiple processors, making it adaptable for throughput sequencing projects. Software documentation assists the user in configuring the system. AVAILABILITY AND IMPLEMENTATION: multiPhATE was implemented in Python 3.7, and runs as a command-line code under Linux or Unix. multiPhATE is freely available under an open-source BSD3 license from https://github.com/carolzhou/multiPhATE. Instructions for acquiring the databases and third-party codes used by multiPhATE are included in the distribution README file. Users may report bugs by submitting to the github issues page associated with the multiPhATE distribution. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-11-01 2019-05-14 /pmc/articles/PMC6821344/ /pubmed/31086982 http://dx.doi.org/10.1093/bioinformatics/btz258 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Applications Notes
Ecale Zhou, Carol L
Malfatti, Stephanie
Kimbrel, Jeffrey
Philipson, Casandra
McNair, Katelyn
Hamilton, Theron
Edwards, Robert
Souza, Brian
multiPhATE: bioinformatics pipeline for functional annotation of phage isolates
title multiPhATE: bioinformatics pipeline for functional annotation of phage isolates
title_full multiPhATE: bioinformatics pipeline for functional annotation of phage isolates
title_fullStr multiPhATE: bioinformatics pipeline for functional annotation of phage isolates
title_full_unstemmed multiPhATE: bioinformatics pipeline for functional annotation of phage isolates
title_short multiPhATE: bioinformatics pipeline for functional annotation of phage isolates
title_sort multiphate: bioinformatics pipeline for functional annotation of phage isolates
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6821344/
https://www.ncbi.nlm.nih.gov/pubmed/31086982
http://dx.doi.org/10.1093/bioinformatics/btz258
work_keys_str_mv AT ecalezhoucaroll multiphatebioinformaticspipelineforfunctionalannotationofphageisolates
AT malfattistephanie multiphatebioinformaticspipelineforfunctionalannotationofphageisolates
AT kimbreljeffrey multiphatebioinformaticspipelineforfunctionalannotationofphageisolates
AT philipsoncasandra multiphatebioinformaticspipelineforfunctionalannotationofphageisolates
AT mcnairkatelyn multiphatebioinformaticspipelineforfunctionalannotationofphageisolates
AT hamiltontheron multiphatebioinformaticspipelineforfunctionalannotationofphageisolates
AT edwardsrobert multiphatebioinformaticspipelineforfunctionalannotationofphageisolates
AT souzabrian multiphatebioinformaticspipelineforfunctionalannotationofphageisolates