Cargando…

A quick guide for student-driven community genome annotation

High quality gene models are necessary to expand the molecular and genetic tools available for a target organism, but these are available for only a handful of model organisms that have undergone extensive curation and experimental validation over the course of many years. The majority of gene model...

Descripción completa

Detalles Bibliográficos
Autores principales: Hosmani, Prashant S., Shippy, Teresa, Miller, Sherry, Benoit, Joshua B., Munoz-Torres, Monica, Flores-Gonzalez, Mirella, Mueller, Lukas A., Wiersma-Koch, Helen, D’Elia, Tom, Brown, Susan J., Saha, Surya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6447164/
https://www.ncbi.nlm.nih.gov/pubmed/30943207
http://dx.doi.org/10.1371/journal.pcbi.1006682
_version_ 1783408467109740544
author Hosmani, Prashant S.
Shippy, Teresa
Miller, Sherry
Benoit, Joshua B.
Munoz-Torres, Monica
Flores-Gonzalez, Mirella
Mueller, Lukas A.
Wiersma-Koch, Helen
D’Elia, Tom
Brown, Susan J.
Saha, Surya
author_facet Hosmani, Prashant S.
Shippy, Teresa
Miller, Sherry
Benoit, Joshua B.
Munoz-Torres, Monica
Flores-Gonzalez, Mirella
Mueller, Lukas A.
Wiersma-Koch, Helen
D’Elia, Tom
Brown, Susan J.
Saha, Surya
author_sort Hosmani, Prashant S.
collection PubMed
description High quality gene models are necessary to expand the molecular and genetic tools available for a target organism, but these are available for only a handful of model organisms that have undergone extensive curation and experimental validation over the course of many years. The majority of gene models present in biological databases today have been identified in draft genome assemblies using automated annotation pipelines that are frequently based on orthologs from distantly related model organisms and usually have minor or major errors. Manual curation is time consuming and often requires substantial expertise, but is instrumental in improving gene model structure and identification. Manual annotation may seem to be a daunting and cost-prohibitive task for small research communities but involving undergraduates in community genome annotation consortiums can be mutually beneficial for both education and improved genomic resources. We outline a workflow for efficient manual annotation driven by a team of primarily undergraduate annotators. This model can be scaled to large teams and includes quality control processes through incremental evaluation. Moreover, it gives students an opportunity to increase their understanding of genome biology and to participate in scientific research in collaboration with peers and senior researchers at multiple institutions.
format Online
Article
Text
id pubmed-6447164
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-64471642019-04-17 A quick guide for student-driven community genome annotation Hosmani, Prashant S. Shippy, Teresa Miller, Sherry Benoit, Joshua B. Munoz-Torres, Monica Flores-Gonzalez, Mirella Mueller, Lukas A. Wiersma-Koch, Helen D’Elia, Tom Brown, Susan J. Saha, Surya PLoS Comput Biol Education High quality gene models are necessary to expand the molecular and genetic tools available for a target organism, but these are available for only a handful of model organisms that have undergone extensive curation and experimental validation over the course of many years. The majority of gene models present in biological databases today have been identified in draft genome assemblies using automated annotation pipelines that are frequently based on orthologs from distantly related model organisms and usually have minor or major errors. Manual curation is time consuming and often requires substantial expertise, but is instrumental in improving gene model structure and identification. Manual annotation may seem to be a daunting and cost-prohibitive task for small research communities but involving undergraduates in community genome annotation consortiums can be mutually beneficial for both education and improved genomic resources. We outline a workflow for efficient manual annotation driven by a team of primarily undergraduate annotators. This model can be scaled to large teams and includes quality control processes through incremental evaluation. Moreover, it gives students an opportunity to increase their understanding of genome biology and to participate in scientific research in collaboration with peers and senior researchers at multiple institutions. Public Library of Science 2019-04-03 /pmc/articles/PMC6447164/ /pubmed/30943207 http://dx.doi.org/10.1371/journal.pcbi.1006682 Text en © 2019 Hosmani et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Education
Hosmani, Prashant S.
Shippy, Teresa
Miller, Sherry
Benoit, Joshua B.
Munoz-Torres, Monica
Flores-Gonzalez, Mirella
Mueller, Lukas A.
Wiersma-Koch, Helen
D’Elia, Tom
Brown, Susan J.
Saha, Surya
A quick guide for student-driven community genome annotation
title A quick guide for student-driven community genome annotation
title_full A quick guide for student-driven community genome annotation
title_fullStr A quick guide for student-driven community genome annotation
title_full_unstemmed A quick guide for student-driven community genome annotation
title_short A quick guide for student-driven community genome annotation
title_sort quick guide for student-driven community genome annotation
topic Education
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6447164/
https://www.ncbi.nlm.nih.gov/pubmed/30943207
http://dx.doi.org/10.1371/journal.pcbi.1006682
work_keys_str_mv AT hosmaniprashants aquickguideforstudentdrivencommunitygenomeannotation
AT shippyteresa aquickguideforstudentdrivencommunitygenomeannotation
AT millersherry aquickguideforstudentdrivencommunitygenomeannotation
AT benoitjoshuab aquickguideforstudentdrivencommunitygenomeannotation
AT munoztorresmonica aquickguideforstudentdrivencommunitygenomeannotation
AT floresgonzalezmirella aquickguideforstudentdrivencommunitygenomeannotation
AT muellerlukasa aquickguideforstudentdrivencommunitygenomeannotation
AT wiersmakochhelen aquickguideforstudentdrivencommunitygenomeannotation
AT deliatom aquickguideforstudentdrivencommunitygenomeannotation
AT brownsusanj aquickguideforstudentdrivencommunitygenomeannotation
AT sahasurya aquickguideforstudentdrivencommunitygenomeannotation
AT hosmaniprashants quickguideforstudentdrivencommunitygenomeannotation
AT shippyteresa quickguideforstudentdrivencommunitygenomeannotation
AT millersherry quickguideforstudentdrivencommunitygenomeannotation
AT benoitjoshuab quickguideforstudentdrivencommunitygenomeannotation
AT munoztorresmonica quickguideforstudentdrivencommunitygenomeannotation
AT floresgonzalezmirella quickguideforstudentdrivencommunitygenomeannotation
AT muellerlukasa quickguideforstudentdrivencommunitygenomeannotation
AT wiersmakochhelen quickguideforstudentdrivencommunitygenomeannotation
AT deliatom quickguideforstudentdrivencommunitygenomeannotation
AT brownsusanj quickguideforstudentdrivencommunitygenomeannotation
AT sahasurya quickguideforstudentdrivencommunitygenomeannotation