Cargando…

A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation

Bacteriophages are the most numerous entities on Earth. The number of sequenced phage genomes is approximately 8000 and increasing rapidly. Sequencing of a genome is followed by annotation, where genes, start codons, and functions are putatively identified. The mainstays of phage genome annotation a...

Descripción completa

Detalles Bibliográficos
Autores principales: Salisbury, Alicia, Tsourkas, Philippos K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6678273/
https://www.ncbi.nlm.nih.gov/pubmed/31295925
http://dx.doi.org/10.3390/ijms20143391
_version_ 1783441061488623616
author Salisbury, Alicia
Tsourkas, Philippos K.
author_facet Salisbury, Alicia
Tsourkas, Philippos K.
author_sort Salisbury, Alicia
collection PubMed
description Bacteriophages are the most numerous entities on Earth. The number of sequenced phage genomes is approximately 8000 and increasing rapidly. Sequencing of a genome is followed by annotation, where genes, start codons, and functions are putatively identified. The mainstays of phage genome annotation are auto-annotation programs such as Glimmer and GeneMark. Due to the relatively small size of phage genomes, many groups choose to manually curate auto-annotation results to increase accuracy. An additional benefit of manual curation of auto-annotated phage genomes is that the process is amenable to be performed by students, and has been shown to improve student recruitment to the sciences. However, despite its greater accuracy and pedagogical value, manual curation suffers from high labor cost, lack of standardization and a degree of subjectivity in decision making, and susceptibility to mistakes. Here, we present a method developed in our lab that is designed to produce accurate annotations while reducing subjectivity and providing a degree of standardization in decision-making. We show that our method produces genome annotations more accurate than auto-annotation programs while retaining the pedagogical benefits of manual genome curation.
format Online
Article
Text
id pubmed-6678273
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-66782732019-08-19 A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation Salisbury, Alicia Tsourkas, Philippos K. Int J Mol Sci Article Bacteriophages are the most numerous entities on Earth. The number of sequenced phage genomes is approximately 8000 and increasing rapidly. Sequencing of a genome is followed by annotation, where genes, start codons, and functions are putatively identified. The mainstays of phage genome annotation are auto-annotation programs such as Glimmer and GeneMark. Due to the relatively small size of phage genomes, many groups choose to manually curate auto-annotation results to increase accuracy. An additional benefit of manual curation of auto-annotated phage genomes is that the process is amenable to be performed by students, and has been shown to improve student recruitment to the sciences. However, despite its greater accuracy and pedagogical value, manual curation suffers from high labor cost, lack of standardization and a degree of subjectivity in decision making, and susceptibility to mistakes. Here, we present a method developed in our lab that is designed to produce accurate annotations while reducing subjectivity and providing a degree of standardization in decision-making. We show that our method produces genome annotations more accurate than auto-annotation programs while retaining the pedagogical benefits of manual genome curation. MDPI 2019-07-10 /pmc/articles/PMC6678273/ /pubmed/31295925 http://dx.doi.org/10.3390/ijms20143391 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Salisbury, Alicia
Tsourkas, Philippos K.
A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation
title A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation
title_full A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation
title_fullStr A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation
title_full_unstemmed A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation
title_short A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation
title_sort method for improving the accuracy and efficiency of bacteriophage genome annotation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6678273/
https://www.ncbi.nlm.nih.gov/pubmed/31295925
http://dx.doi.org/10.3390/ijms20143391
work_keys_str_mv AT salisburyalicia amethodforimprovingtheaccuracyandefficiencyofbacteriophagegenomeannotation
AT tsourkasphilipposk amethodforimprovingtheaccuracyandefficiencyofbacteriophagegenomeannotation
AT salisburyalicia methodforimprovingtheaccuracyandefficiencyofbacteriophagegenomeannotation
AT tsourkasphilipposk methodforimprovingtheaccuracyandefficiencyofbacteriophagegenomeannotation