Cargando…
Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs
The number of sequenced bacteriophage genomes is growing at an exponential rate. The majority of sequenced bacteriophage genomes are annotated by one or more of several freely available gene identification programs (Glimmer, GeneMark, RAST, Prodigal, etc.). No program has been shown to consistently...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Mary Ann Liebert, Inc., publishers
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9041506/ https://www.ncbi.nlm.nih.gov/pubmed/36147516 http://dx.doi.org/10.1089/phage.2020.0044 |
_version_ | 1784694543784869888 |
---|---|
author | Lazeroff, Matt Ryder, Geordie Harris, Sarah L. Tsourkas, Philippos K. |
author_facet | Lazeroff, Matt Ryder, Geordie Harris, Sarah L. Tsourkas, Philippos K. |
author_sort | Lazeroff, Matt |
collection | PubMed |
description | The number of sequenced bacteriophage genomes is growing at an exponential rate. The majority of sequenced bacteriophage genomes are annotated by one or more of several freely available gene identification programs (Glimmer, GeneMark, RAST, Prodigal, etc.). No program has been shown to consistently outperform the others; thus, the choice of which program to use is not obvious. We present the Phage Commander application for rapid identification of bacteriophage genes using multiple gene identification programs. Phage Commander runs a bacteriophage genome sequence through nine gene identification programs (and an additional program for identification of tRNAs) and integrates the results within a single output table. Phage Commander also generates formatted output files for direct export to National Center for Biotechnology Information GenBank or genome visualization programs such as DNA Master. Users can select the threshold for which genes to export (genes identified by at least one program, genes identified by at least two programs, etc.). Phage Commander was benchmarked using eight high-quality bacteriophage genomes whose genes are backed by experimental data. Our results show that the most accurate annotations are obtained by exporting genes identified by at least two or three programs. Many groups opt to manually curate the annotations obtained from gene identification programs, and Phage Commander was designed to facilitate manual curation of genome annotations. Our benchmarking results show that manual curation does indeed produce more accurate annotations than any individual gene identification program. The authors thus recommend manually curating the output of Phage Commander to generate maximally accurate annotations. Phage Commander is currently being used in the corresponding author's bacteriophage genome annotation class and has reduced the labor cost and improved the quality of genome annotations. |
format | Online Article Text |
id | pubmed-9041506 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Mary Ann Liebert, Inc., publishers |
record_format | MEDLINE/PubMed |
spelling | pubmed-90415062022-09-21 Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs Lazeroff, Matt Ryder, Geordie Harris, Sarah L. Tsourkas, Philippos K. Phage (New Rochelle) Original Articles The number of sequenced bacteriophage genomes is growing at an exponential rate. The majority of sequenced bacteriophage genomes are annotated by one or more of several freely available gene identification programs (Glimmer, GeneMark, RAST, Prodigal, etc.). No program has been shown to consistently outperform the others; thus, the choice of which program to use is not obvious. We present the Phage Commander application for rapid identification of bacteriophage genes using multiple gene identification programs. Phage Commander runs a bacteriophage genome sequence through nine gene identification programs (and an additional program for identification of tRNAs) and integrates the results within a single output table. Phage Commander also generates formatted output files for direct export to National Center for Biotechnology Information GenBank or genome visualization programs such as DNA Master. Users can select the threshold for which genes to export (genes identified by at least one program, genes identified by at least two programs, etc.). Phage Commander was benchmarked using eight high-quality bacteriophage genomes whose genes are backed by experimental data. Our results show that the most accurate annotations are obtained by exporting genes identified by at least two or three programs. Many groups opt to manually curate the annotations obtained from gene identification programs, and Phage Commander was designed to facilitate manual curation of genome annotations. Our benchmarking results show that manual curation does indeed produce more accurate annotations than any individual gene identification program. The authors thus recommend manually curating the output of Phage Commander to generate maximally accurate annotations. Phage Commander is currently being used in the corresponding author's bacteriophage genome annotation class and has reduced the labor cost and improved the quality of genome annotations. Mary Ann Liebert, Inc., publishers 2021-12-01 2021-12-16 /pmc/articles/PMC9041506/ /pubmed/36147516 http://dx.doi.org/10.1089/phage.2020.0044 Text en © Matt Lazeroff et al. 2021; Published by Mary Ann Liebert, Inc. https://creativecommons.org/licenses/by-nc/4.0/This Open Access article is distributed under the terms of the Creative Commons Attribution Noncommercial License [CC-BY-NC] (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ) which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and the source are cited. |
spellingShingle | Original Articles Lazeroff, Matt Ryder, Geordie Harris, Sarah L. Tsourkas, Philippos K. Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs |
title | Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs |
title_full | Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs |
title_fullStr | Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs |
title_full_unstemmed | Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs |
title_short | Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs |
title_sort | phage commander, an application for rapid gene identification in bacteriophage genomes using multiple programs |
topic | Original Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9041506/ https://www.ncbi.nlm.nih.gov/pubmed/36147516 http://dx.doi.org/10.1089/phage.2020.0044 |
work_keys_str_mv | AT lazeroffmatt phagecommanderanapplicationforrapidgeneidentificationinbacteriophagegenomesusingmultipleprograms AT rydergeordie phagecommanderanapplicationforrapidgeneidentificationinbacteriophagegenomesusingmultipleprograms AT harrissarahl phagecommanderanapplicationforrapidgeneidentificationinbacteriophagegenomesusingmultipleprograms AT tsourkasphilipposk phagecommanderanapplicationforrapidgeneidentificationinbacteriophagegenomesusingmultipleprograms |