Cargando…

Genome Annotation Generator: a simple tool for generating and correcting WGS annotation tables for NCBI submission

BACKGROUND: One of the most overlooked, yet critical, components of a whole genome sequencing (WGS) project is the submission and curation of the data to a genomic repository, most commonly the National Center for Biotechnology Information (NCBI). While large genome centers or genome groups have dev...

Descripción completa

Detalles Bibliográficos
Autores principales: Geib, Scott M, Hall, Brian, Derego, Theodore, Bremer, Forest T, Cannoles, Kyle, Sim, Sheina B
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5887294/
https://www.ncbi.nlm.nih.gov/pubmed/29635297
http://dx.doi.org/10.1093/gigascience/giy018
_version_ 1783312268991135744
author Geib, Scott M
Hall, Brian
Derego, Theodore
Bremer, Forest T
Cannoles, Kyle
Sim, Sheina B
author_facet Geib, Scott M
Hall, Brian
Derego, Theodore
Bremer, Forest T
Cannoles, Kyle
Sim, Sheina B
author_sort Geib, Scott M
collection PubMed
description BACKGROUND: One of the most overlooked, yet critical, components of a whole genome sequencing (WGS) project is the submission and curation of the data to a genomic repository, most commonly the National Center for Biotechnology Information (NCBI). While large genome centers or genome groups have developed software tools for post-annotation assembly filtering, annotation, and conversion into the NCBI’s annotation table format, these tools typically require back-end setup and connection to an Structured Query Language (SQL) database and/or some knowledge of programming (Perl, Python) to implement. With WGS becoming commonplace, genome sequencing projects are moving away from the genome centers and into the ecology or biology lab, where fewer resources are present to support the process of genome assembly curation. To fill this gap, we developed software to assess, filter, and transfer annotation and convert a draft genome assembly and annotation set into the NCBI annotation table (.tbl) format, facilitating submission to the NCBI Genome Assembly database. This software has no dependencies, is compatible across platforms, and utilizes a simple command to perform a variety of simple and complex post-analysis, pre-NCBI submission WGS project tasks. FINDINGS: The Genome Annotation Generator is a consistent and user-friendly bioinformatics tool that can be used to generate a .tbl file that is consistent with the NCBI submission pipeline CONCLUSIONS: The Genome Annotation Generator achieves the goal of providing a publicly available tool that will facilitate the submission of annotated genome assemblies to the NCBI. It is useful for any individual researcher or research group that wishes to submit a genome assembly of their study system to the NCBI.
format Online
Article
Text
id pubmed-5887294
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58872942018-04-11 Genome Annotation Generator: a simple tool for generating and correcting WGS annotation tables for NCBI submission Geib, Scott M Hall, Brian Derego, Theodore Bremer, Forest T Cannoles, Kyle Sim, Sheina B Gigascience Technical Note BACKGROUND: One of the most overlooked, yet critical, components of a whole genome sequencing (WGS) project is the submission and curation of the data to a genomic repository, most commonly the National Center for Biotechnology Information (NCBI). While large genome centers or genome groups have developed software tools for post-annotation assembly filtering, annotation, and conversion into the NCBI’s annotation table format, these tools typically require back-end setup and connection to an Structured Query Language (SQL) database and/or some knowledge of programming (Perl, Python) to implement. With WGS becoming commonplace, genome sequencing projects are moving away from the genome centers and into the ecology or biology lab, where fewer resources are present to support the process of genome assembly curation. To fill this gap, we developed software to assess, filter, and transfer annotation and convert a draft genome assembly and annotation set into the NCBI annotation table (.tbl) format, facilitating submission to the NCBI Genome Assembly database. This software has no dependencies, is compatible across platforms, and utilizes a simple command to perform a variety of simple and complex post-analysis, pre-NCBI submission WGS project tasks. FINDINGS: The Genome Annotation Generator is a consistent and user-friendly bioinformatics tool that can be used to generate a .tbl file that is consistent with the NCBI submission pipeline CONCLUSIONS: The Genome Annotation Generator achieves the goal of providing a publicly available tool that will facilitate the submission of annotated genome assemblies to the NCBI. It is useful for any individual researcher or research group that wishes to submit a genome assembly of their study system to the NCBI. Oxford University Press 2018-03-04 /pmc/articles/PMC5887294/ /pubmed/29635297 http://dx.doi.org/10.1093/gigascience/giy018 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Geib, Scott M
Hall, Brian
Derego, Theodore
Bremer, Forest T
Cannoles, Kyle
Sim, Sheina B
Genome Annotation Generator: a simple tool for generating and correcting WGS annotation tables for NCBI submission
title Genome Annotation Generator: a simple tool for generating and correcting WGS annotation tables for NCBI submission
title_full Genome Annotation Generator: a simple tool for generating and correcting WGS annotation tables for NCBI submission
title_fullStr Genome Annotation Generator: a simple tool for generating and correcting WGS annotation tables for NCBI submission
title_full_unstemmed Genome Annotation Generator: a simple tool for generating and correcting WGS annotation tables for NCBI submission
title_short Genome Annotation Generator: a simple tool for generating and correcting WGS annotation tables for NCBI submission
title_sort genome annotation generator: a simple tool for generating and correcting wgs annotation tables for ncbi submission
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5887294/
https://www.ncbi.nlm.nih.gov/pubmed/29635297
http://dx.doi.org/10.1093/gigascience/giy018
work_keys_str_mv AT geibscottm genomeannotationgeneratorasimpletoolforgeneratingandcorrectingwgsannotationtablesforncbisubmission
AT hallbrian genomeannotationgeneratorasimpletoolforgeneratingandcorrectingwgsannotationtablesforncbisubmission
AT deregotheodore genomeannotationgeneratorasimpletoolforgeneratingandcorrectingwgsannotationtablesforncbisubmission
AT bremerforestt genomeannotationgeneratorasimpletoolforgeneratingandcorrectingwgsannotationtablesforncbisubmission
AT cannoleskyle genomeannotationgeneratorasimpletoolforgeneratingandcorrectingwgsannotationtablesforncbisubmission
AT simsheinab genomeannotationgeneratorasimpletoolforgeneratingandcorrectingwgsannotationtablesforncbisubmission