Cargando…

Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs

The number of sequenced bacteriophage genomes is growing at an exponential rate. The majority of sequenced bacteriophage genomes are annotated by one or more of several freely available gene identification programs (Glimmer, GeneMark, RAST, Prodigal, etc.). No program has been shown to consistently...

Descripción completa

Detalles Bibliográficos
Autores principales: Lazeroff, Matt, Ryder, Geordie, Harris, Sarah L., Tsourkas, Philippos K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Mary Ann Liebert, Inc., publishers 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9041506/
https://www.ncbi.nlm.nih.gov/pubmed/36147516
http://dx.doi.org/10.1089/phage.2020.0044
_version_ 1784694543784869888
author Lazeroff, Matt
Ryder, Geordie
Harris, Sarah L.
Tsourkas, Philippos K.
author_facet Lazeroff, Matt
Ryder, Geordie
Harris, Sarah L.
Tsourkas, Philippos K.
author_sort Lazeroff, Matt
collection PubMed
description The number of sequenced bacteriophage genomes is growing at an exponential rate. The majority of sequenced bacteriophage genomes are annotated by one or more of several freely available gene identification programs (Glimmer, GeneMark, RAST, Prodigal, etc.). No program has been shown to consistently outperform the others; thus, the choice of which program to use is not obvious. We present the Phage Commander application for rapid identification of bacteriophage genes using multiple gene identification programs. Phage Commander runs a bacteriophage genome sequence through nine gene identification programs (and an additional program for identification of tRNAs) and integrates the results within a single output table. Phage Commander also generates formatted output files for direct export to National Center for Biotechnology Information GenBank or genome visualization programs such as DNA Master. Users can select the threshold for which genes to export (genes identified by at least one program, genes identified by at least two programs, etc.). Phage Commander was benchmarked using eight high-quality bacteriophage genomes whose genes are backed by experimental data. Our results show that the most accurate annotations are obtained by exporting genes identified by at least two or three programs. Many groups opt to manually curate the annotations obtained from gene identification programs, and Phage Commander was designed to facilitate manual curation of genome annotations. Our benchmarking results show that manual curation does indeed produce more accurate annotations than any individual gene identification program. The authors thus recommend manually curating the output of Phage Commander to generate maximally accurate annotations. Phage Commander is currently being used in the corresponding author's bacteriophage genome annotation class and has reduced the labor cost and improved the quality of genome annotations.
format Online
Article
Text
id pubmed-9041506
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Mary Ann Liebert, Inc., publishers
record_format MEDLINE/PubMed
spelling pubmed-90415062022-09-21 Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs Lazeroff, Matt Ryder, Geordie Harris, Sarah L. Tsourkas, Philippos K. Phage (New Rochelle) Original Articles The number of sequenced bacteriophage genomes is growing at an exponential rate. The majority of sequenced bacteriophage genomes are annotated by one or more of several freely available gene identification programs (Glimmer, GeneMark, RAST, Prodigal, etc.). No program has been shown to consistently outperform the others; thus, the choice of which program to use is not obvious. We present the Phage Commander application for rapid identification of bacteriophage genes using multiple gene identification programs. Phage Commander runs a bacteriophage genome sequence through nine gene identification programs (and an additional program for identification of tRNAs) and integrates the results within a single output table. Phage Commander also generates formatted output files for direct export to National Center for Biotechnology Information GenBank or genome visualization programs such as DNA Master. Users can select the threshold for which genes to export (genes identified by at least one program, genes identified by at least two programs, etc.). Phage Commander was benchmarked using eight high-quality bacteriophage genomes whose genes are backed by experimental data. Our results show that the most accurate annotations are obtained by exporting genes identified by at least two or three programs. Many groups opt to manually curate the annotations obtained from gene identification programs, and Phage Commander was designed to facilitate manual curation of genome annotations. Our benchmarking results show that manual curation does indeed produce more accurate annotations than any individual gene identification program. The authors thus recommend manually curating the output of Phage Commander to generate maximally accurate annotations. Phage Commander is currently being used in the corresponding author's bacteriophage genome annotation class and has reduced the labor cost and improved the quality of genome annotations. Mary Ann Liebert, Inc., publishers 2021-12-01 2021-12-16 /pmc/articles/PMC9041506/ /pubmed/36147516 http://dx.doi.org/10.1089/phage.2020.0044 Text en © Matt Lazeroff et al. 2021; Published by Mary Ann Liebert, Inc. https://creativecommons.org/licenses/by-nc/4.0/This Open Access article is distributed under the terms of the Creative Commons Attribution Noncommercial License [CC-BY-NC] (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ) which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and the source are cited.
spellingShingle Original Articles
Lazeroff, Matt
Ryder, Geordie
Harris, Sarah L.
Tsourkas, Philippos K.
Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs
title Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs
title_full Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs
title_fullStr Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs
title_full_unstemmed Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs
title_short Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs
title_sort phage commander, an application for rapid gene identification in bacteriophage genomes using multiple programs
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9041506/
https://www.ncbi.nlm.nih.gov/pubmed/36147516
http://dx.doi.org/10.1089/phage.2020.0044
work_keys_str_mv AT lazeroffmatt phagecommanderanapplicationforrapidgeneidentificationinbacteriophagegenomesusingmultipleprograms
AT rydergeordie phagecommanderanapplicationforrapidgeneidentificationinbacteriophagegenomesusingmultipleprograms
AT harrissarahl phagecommanderanapplicationforrapidgeneidentificationinbacteriophagegenomesusingmultipleprograms
AT tsourkasphilipposk phagecommanderanapplicationforrapidgeneidentificationinbacteriophagegenomesusingmultipleprograms