Cargando…

GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes

BACKGROUND: The function of a novel gene product is typically predicted by transitive assignment of annotation from similar sequences. We describe a novel method, GOtcha, for predicting gene product function by annotation with Gene Ontology (GO) terms. GOtcha predicts GO term associations with term-...

Descripción completa

Detalles Bibliográficos
Autores principales: Martin, David MA, Berriman, Matthew, Barton, Geoffrey J
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC535938/
https://www.ncbi.nlm.nih.gov/pubmed/15550167
http://dx.doi.org/10.1186/1471-2105-5-178
_version_ 1782122039268605952
author Martin, David MA
Berriman, Matthew
Barton, Geoffrey J
author_facet Martin, David MA
Berriman, Matthew
Barton, Geoffrey J
author_sort Martin, David MA
collection PubMed
description BACKGROUND: The function of a novel gene product is typically predicted by transitive assignment of annotation from similar sequences. We describe a novel method, GOtcha, for predicting gene product function by annotation with Gene Ontology (GO) terms. GOtcha predicts GO term associations with term-specific probability (P-score) measures of confidence. Term-specific probabilities are a novel feature of GOtcha and allow the identification of conflicts or uncertainty in annotation. RESULTS: The GOtcha method was applied to the recently sequenced genome for Plasmodium falciparum and six other genomes. GOtcha was compared quantitatively for retrieval of assigned GO terms against direct transitive assignment from the highest scoring annotated BLAST search hit (TOPBLAST). GOtcha exploits information deep into the 'twilight zone' of similarity search matches, making use of much information that is otherwise discarded by more simplistic approaches. At a P-score cutoff of 50%, GOtcha provided 60% better recovery of annotation terms and 20% higher selectivity than annotation with TOPBLAST at an E-value cutoff of 10(-4). CONCLUSIONS: The GOtcha method is a useful tool for genome annotators. It has identified both errors and omissions in the original Plasmodium falciparum annotation and is being adopted by many other genome sequencing projects.
format Text
id pubmed-535938
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-5359382004-12-18 GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes Martin, David MA Berriman, Matthew Barton, Geoffrey J BMC Bioinformatics Methodology Article BACKGROUND: The function of a novel gene product is typically predicted by transitive assignment of annotation from similar sequences. We describe a novel method, GOtcha, for predicting gene product function by annotation with Gene Ontology (GO) terms. GOtcha predicts GO term associations with term-specific probability (P-score) measures of confidence. Term-specific probabilities are a novel feature of GOtcha and allow the identification of conflicts or uncertainty in annotation. RESULTS: The GOtcha method was applied to the recently sequenced genome for Plasmodium falciparum and six other genomes. GOtcha was compared quantitatively for retrieval of assigned GO terms against direct transitive assignment from the highest scoring annotated BLAST search hit (TOPBLAST). GOtcha exploits information deep into the 'twilight zone' of similarity search matches, making use of much information that is otherwise discarded by more simplistic approaches. At a P-score cutoff of 50%, GOtcha provided 60% better recovery of annotation terms and 20% higher selectivity than annotation with TOPBLAST at an E-value cutoff of 10(-4). CONCLUSIONS: The GOtcha method is a useful tool for genome annotators. It has identified both errors and omissions in the original Plasmodium falciparum annotation and is being adopted by many other genome sequencing projects. BioMed Central 2004-11-18 /pmc/articles/PMC535938/ /pubmed/15550167 http://dx.doi.org/10.1186/1471-2105-5-178 Text en Copyright © 2004 Martin et al; licensee BioMed Central Ltd.
spellingShingle Methodology Article
Martin, David MA
Berriman, Matthew
Barton, Geoffrey J
GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes
title GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes
title_full GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes
title_fullStr GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes
title_full_unstemmed GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes
title_short GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes
title_sort gotcha: a new method for prediction of protein function assessed by the annotation of seven genomes
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC535938/
https://www.ncbi.nlm.nih.gov/pubmed/15550167
http://dx.doi.org/10.1186/1471-2105-5-178
work_keys_str_mv AT martindavidma gotchaanewmethodforpredictionofproteinfunctionassessedbytheannotationofsevengenomes
AT berrimanmatthew gotchaanewmethodforpredictionofproteinfunctionassessedbytheannotationofsevengenomes
AT bartongeoffreyj gotchaanewmethodforpredictionofproteinfunctionassessedbytheannotationofsevengenomes