Cargando…

Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study

Annotation using Gene Ontology (GO) terms is one of the most important ways in which biological information about specific gene products can be expressed in a searchable, computable form that may be compared across genomes and organisms. Because literature-based GO annotations are often used to prop...

Descripción completa

Detalles Bibliográficos
Autores principales: Costanzo, Maria C., Park, Julie, Balakrishnan, Rama, Cherry, J. Michael, Hong, Eurie L.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3067894/
https://www.ncbi.nlm.nih.gov/pubmed/21411447
http://dx.doi.org/10.1093/database/bar004
_version_ 1782201171461537792
author Costanzo, Maria C.
Park, Julie
Balakrishnan, Rama
Cherry, J. Michael
Hong, Eurie L.
author_facet Costanzo, Maria C.
Park, Julie
Balakrishnan, Rama
Cherry, J. Michael
Hong, Eurie L.
author_sort Costanzo, Maria C.
collection PubMed
description Annotation using Gene Ontology (GO) terms is one of the most important ways in which biological information about specific gene products can be expressed in a searchable, computable form that may be compared across genomes and organisms. Because literature-based GO annotations are often used to propagate functional predictions between related proteins, their accuracy is critically important. We present a strategy that employs a comparison of literature-based annotations with computational predictions to identify and prioritize genes whose annotations need review. Using this method, we show that comparison of manually assigned ‘unknown’ annotations in the Saccharomyces Genome Database (SGD) with InterPro-based predictions can identify annotations that need to be updated. A survey of literature-based annotations and computational predictions made by the Gene Ontology Annotation (GOA) project at the European Bioinformatics Institute (EBI) across several other databases shows that this comparison strategy could be used to maintain and improve the quality of GO annotations for other organisms besides yeast. The survey also shows that although GOA-assigned predictions are the most comprehensive source of functional information for many genomes, a large proportion of genes in a variety of different organisms entirely lack these predictions but do have manual annotations. This underscores the critical need for manually performed, literature-based curation to provide functional information about genes that are outside the scope of widely used computational methods. Thus, the combination of manual and computational methods is essential to provide the most accurate and complete functional annotation of a genome. Database URL: http://www.yeastgenome.org
format Text
id pubmed-3067894
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30678942011-03-30 Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study Costanzo, Maria C. Park, Julie Balakrishnan, Rama Cherry, J. Michael Hong, Eurie L. Database (Oxford) Original Article Annotation using Gene Ontology (GO) terms is one of the most important ways in which biological information about specific gene products can be expressed in a searchable, computable form that may be compared across genomes and organisms. Because literature-based GO annotations are often used to propagate functional predictions between related proteins, their accuracy is critically important. We present a strategy that employs a comparison of literature-based annotations with computational predictions to identify and prioritize genes whose annotations need review. Using this method, we show that comparison of manually assigned ‘unknown’ annotations in the Saccharomyces Genome Database (SGD) with InterPro-based predictions can identify annotations that need to be updated. A survey of literature-based annotations and computational predictions made by the Gene Ontology Annotation (GOA) project at the European Bioinformatics Institute (EBI) across several other databases shows that this comparison strategy could be used to maintain and improve the quality of GO annotations for other organisms besides yeast. The survey also shows that although GOA-assigned predictions are the most comprehensive source of functional information for many genomes, a large proportion of genes in a variety of different organisms entirely lack these predictions but do have manual annotations. This underscores the critical need for manually performed, literature-based curation to provide functional information about genes that are outside the scope of widely used computational methods. Thus, the combination of manual and computational methods is essential to provide the most accurate and complete functional annotation of a genome. Database URL: http://www.yeastgenome.org Oxford University Press 2011-03-15 /pmc/articles/PMC3067894/ /pubmed/21411447 http://dx.doi.org/10.1093/database/bar004 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Costanzo, Maria C.
Park, Julie
Balakrishnan, Rama
Cherry, J. Michael
Hong, Eurie L.
Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study
title Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study
title_full Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study
title_fullStr Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study
title_full_unstemmed Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study
title_short Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study
title_sort using computational predictions to improve literature-based gene ontology annotations: a feasibility study
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3067894/
https://www.ncbi.nlm.nih.gov/pubmed/21411447
http://dx.doi.org/10.1093/database/bar004
work_keys_str_mv AT costanzomariac usingcomputationalpredictionstoimproveliteraturebasedgeneontologyannotationsafeasibilitystudy
AT parkjulie usingcomputationalpredictionstoimproveliteraturebasedgeneontologyannotationsafeasibilitystudy
AT balakrishnanrama usingcomputationalpredictionstoimproveliteraturebasedgeneontologyannotationsafeasibilitystudy
AT cherryjmichael usingcomputationalpredictionstoimproveliteraturebasedgeneontologyannotationsafeasibilitystudy
AT hongeuriel usingcomputationalpredictionstoimproveliteraturebasedgeneontologyannotationsafeasibilitystudy