Cargando…

An Educational Bioinformatics Project to Improve Genome Annotation

Scientific advancement is hindered without proper genome annotation because biologists lack a complete understanding of cellular protein functions. In bacterial cells, hypothetical proteins (HPs) are open reading frames with unknown functions. HPs result from either an outdated database or insuffici...

Descripción completa

Detalles Bibliográficos
Autores principales: Amatore, Zoie, Gunn, Susan, Harris, Laura K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7750189/
https://www.ncbi.nlm.nih.gov/pubmed/33365016
http://dx.doi.org/10.3389/fmicb.2020.577497
_version_ 1783625442768453632
author Amatore, Zoie
Gunn, Susan
Harris, Laura K.
author_facet Amatore, Zoie
Gunn, Susan
Harris, Laura K.
author_sort Amatore, Zoie
collection PubMed
description Scientific advancement is hindered without proper genome annotation because biologists lack a complete understanding of cellular protein functions. In bacterial cells, hypothetical proteins (HPs) are open reading frames with unknown functions. HPs result from either an outdated database or insufficient experimental evidence (i.e., indeterminate annotation). While automated annotation reviews help keep genome annotation up to date, often manual reviews are needed to verify proper annotation. Students can provide the manual review necessary to improve genome annotation. This paper outlines an innovative classroom project that determines if HPs have outdated or indeterminate annotation. The Hypothetical Protein Characterization Project uses multiple well-documented, freely available, web-based, bioinformatics resources that analyze an amino acid sequence to (1) detect sequence similarities to other proteins, (2) identify domains, (3) predict tertiary structure including active site characterization and potential binding ligands, and (4) determine cellular location. Enough evidence can be generated from these analyses to support re-annotation of HPs or prioritize HPs for experimental examinations such as structural determination via X-ray crystallography. Additionally, this paper details several approaches for selecting HPs to characterize using the Hypothetical Protein Characterization Project. These approaches include student- and instructor-directed random selection, selection using differential gene expression from mRNA expression data, and selection based on phylogenetic relations. This paper also provides additional resources to support instructional use of the Hypothetical Protein Characterization Project, such as example assignment instructions with grading rubrics, links to training videos in YouTube, and several step-by-step example projects to demonstrate and interpret the range of achievable results that students might encounter. Educational use of the Hypothetical Protein Characterization Project provides students with an opportunity to learn and apply knowledge of bioinformatic programs to address scientific questions. The project is highly customizable in that HP selection and analysis can be specifically formulated based on the scope and purpose of each student’s investigations. Programs used for HP analysis can be easily adapted to course learning objectives. The project can be used in both online and in-seat instruction for a wide variety of undergraduate and graduate classes as well as undergraduate capstone, honor’s, and experiential learning projects.
format Online
Article
Text
id pubmed-7750189
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-77501892020-12-22 An Educational Bioinformatics Project to Improve Genome Annotation Amatore, Zoie Gunn, Susan Harris, Laura K. Front Microbiol Microbiology Scientific advancement is hindered without proper genome annotation because biologists lack a complete understanding of cellular protein functions. In bacterial cells, hypothetical proteins (HPs) are open reading frames with unknown functions. HPs result from either an outdated database or insufficient experimental evidence (i.e., indeterminate annotation). While automated annotation reviews help keep genome annotation up to date, often manual reviews are needed to verify proper annotation. Students can provide the manual review necessary to improve genome annotation. This paper outlines an innovative classroom project that determines if HPs have outdated or indeterminate annotation. The Hypothetical Protein Characterization Project uses multiple well-documented, freely available, web-based, bioinformatics resources that analyze an amino acid sequence to (1) detect sequence similarities to other proteins, (2) identify domains, (3) predict tertiary structure including active site characterization and potential binding ligands, and (4) determine cellular location. Enough evidence can be generated from these analyses to support re-annotation of HPs or prioritize HPs for experimental examinations such as structural determination via X-ray crystallography. Additionally, this paper details several approaches for selecting HPs to characterize using the Hypothetical Protein Characterization Project. These approaches include student- and instructor-directed random selection, selection using differential gene expression from mRNA expression data, and selection based on phylogenetic relations. This paper also provides additional resources to support instructional use of the Hypothetical Protein Characterization Project, such as example assignment instructions with grading rubrics, links to training videos in YouTube, and several step-by-step example projects to demonstrate and interpret the range of achievable results that students might encounter. Educational use of the Hypothetical Protein Characterization Project provides students with an opportunity to learn and apply knowledge of bioinformatic programs to address scientific questions. The project is highly customizable in that HP selection and analysis can be specifically formulated based on the scope and purpose of each student’s investigations. Programs used for HP analysis can be easily adapted to course learning objectives. The project can be used in both online and in-seat instruction for a wide variety of undergraduate and graduate classes as well as undergraduate capstone, honor’s, and experiential learning projects. Frontiers Media S.A. 2020-12-07 /pmc/articles/PMC7750189/ /pubmed/33365016 http://dx.doi.org/10.3389/fmicb.2020.577497 Text en Copyright © 2020 Amatore, Gunn and Harris. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Amatore, Zoie
Gunn, Susan
Harris, Laura K.
An Educational Bioinformatics Project to Improve Genome Annotation
title An Educational Bioinformatics Project to Improve Genome Annotation
title_full An Educational Bioinformatics Project to Improve Genome Annotation
title_fullStr An Educational Bioinformatics Project to Improve Genome Annotation
title_full_unstemmed An Educational Bioinformatics Project to Improve Genome Annotation
title_short An Educational Bioinformatics Project to Improve Genome Annotation
title_sort educational bioinformatics project to improve genome annotation
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7750189/
https://www.ncbi.nlm.nih.gov/pubmed/33365016
http://dx.doi.org/10.3389/fmicb.2020.577497
work_keys_str_mv AT amatorezoie aneducationalbioinformaticsprojecttoimprovegenomeannotation
AT gunnsusan aneducationalbioinformaticsprojecttoimprovegenomeannotation
AT harrislaurak aneducationalbioinformaticsprojecttoimprovegenomeannotation
AT amatorezoie educationalbioinformaticsprojecttoimprovegenomeannotation
AT gunnsusan educationalbioinformaticsprojecttoimprovegenomeannotation
AT harrislaurak educationalbioinformaticsprojecttoimprovegenomeannotation