Cargando…
A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation
Bacteriophages are the most numerous entities on Earth. The number of sequenced phage genomes is approximately 8000 and increasing rapidly. Sequencing of a genome is followed by annotation, where genes, start codons, and functions are putatively identified. The mainstays of phage genome annotation a...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6678273/ https://www.ncbi.nlm.nih.gov/pubmed/31295925 http://dx.doi.org/10.3390/ijms20143391 |
_version_ | 1783441061488623616 |
---|---|
author | Salisbury, Alicia Tsourkas, Philippos K. |
author_facet | Salisbury, Alicia Tsourkas, Philippos K. |
author_sort | Salisbury, Alicia |
collection | PubMed |
description | Bacteriophages are the most numerous entities on Earth. The number of sequenced phage genomes is approximately 8000 and increasing rapidly. Sequencing of a genome is followed by annotation, where genes, start codons, and functions are putatively identified. The mainstays of phage genome annotation are auto-annotation programs such as Glimmer and GeneMark. Due to the relatively small size of phage genomes, many groups choose to manually curate auto-annotation results to increase accuracy. An additional benefit of manual curation of auto-annotated phage genomes is that the process is amenable to be performed by students, and has been shown to improve student recruitment to the sciences. However, despite its greater accuracy and pedagogical value, manual curation suffers from high labor cost, lack of standardization and a degree of subjectivity in decision making, and susceptibility to mistakes. Here, we present a method developed in our lab that is designed to produce accurate annotations while reducing subjectivity and providing a degree of standardization in decision-making. We show that our method produces genome annotations more accurate than auto-annotation programs while retaining the pedagogical benefits of manual genome curation. |
format | Online Article Text |
id | pubmed-6678273 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-66782732019-08-19 A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation Salisbury, Alicia Tsourkas, Philippos K. Int J Mol Sci Article Bacteriophages are the most numerous entities on Earth. The number of sequenced phage genomes is approximately 8000 and increasing rapidly. Sequencing of a genome is followed by annotation, where genes, start codons, and functions are putatively identified. The mainstays of phage genome annotation are auto-annotation programs such as Glimmer and GeneMark. Due to the relatively small size of phage genomes, many groups choose to manually curate auto-annotation results to increase accuracy. An additional benefit of manual curation of auto-annotated phage genomes is that the process is amenable to be performed by students, and has been shown to improve student recruitment to the sciences. However, despite its greater accuracy and pedagogical value, manual curation suffers from high labor cost, lack of standardization and a degree of subjectivity in decision making, and susceptibility to mistakes. Here, we present a method developed in our lab that is designed to produce accurate annotations while reducing subjectivity and providing a degree of standardization in decision-making. We show that our method produces genome annotations more accurate than auto-annotation programs while retaining the pedagogical benefits of manual genome curation. MDPI 2019-07-10 /pmc/articles/PMC6678273/ /pubmed/31295925 http://dx.doi.org/10.3390/ijms20143391 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Salisbury, Alicia Tsourkas, Philippos K. A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation |
title | A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation |
title_full | A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation |
title_fullStr | A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation |
title_full_unstemmed | A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation |
title_short | A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation |
title_sort | method for improving the accuracy and efficiency of bacteriophage genome annotation |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6678273/ https://www.ncbi.nlm.nih.gov/pubmed/31295925 http://dx.doi.org/10.3390/ijms20143391 |
work_keys_str_mv | AT salisburyalicia amethodforimprovingtheaccuracyandefficiencyofbacteriophagegenomeannotation AT tsourkasphilipposk amethodforimprovingtheaccuracyandefficiencyofbacteriophagegenomeannotation AT salisburyalicia methodforimprovingtheaccuracyandefficiencyofbacteriophagegenomeannotation AT tsourkasphilipposk methodforimprovingtheaccuracyandefficiencyofbacteriophagegenomeannotation |