Cargando…

A Topology-Based Metric for Measuring Term Similarity in the Gene Ontology

The wide coverage and biological relevance of the Gene Ontology (GO), confirmed through its successful use in protein function prediction, have led to the growth in its popularity. In order to exploit the extent of biological knowledge that GO offers in describing genes or groups of genes, there is...

Descripción completa

Detalles Bibliográficos
Autores principales: Mazandu, Gaston K., Mulder, Nicola J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3361142/
https://www.ncbi.nlm.nih.gov/pubmed/22666244
http://dx.doi.org/10.1155/2012/975783
_version_ 1782234083903930368
author Mazandu, Gaston K.
Mulder, Nicola J.
author_facet Mazandu, Gaston K.
Mulder, Nicola J.
author_sort Mazandu, Gaston K.
collection PubMed
description The wide coverage and biological relevance of the Gene Ontology (GO), confirmed through its successful use in protein function prediction, have led to the growth in its popularity. In order to exploit the extent of biological knowledge that GO offers in describing genes or groups of genes, there is a need for an efficient, scalable similarity measure for GO terms and GO-annotated proteins. While several GO similarity measures exist, none adequately addresses all issues surrounding the design and usage of the ontology. We introduce a new metric for measuring the distance between two GO terms using the intrinsic topology of the GO-DAG, thus enabling the measurement of functional similarities between proteins based on their GO annotations. We assess the performance of this metric using a ROC analysis on human protein-protein interaction datasets and correlation coefficient analysis on the selected set of protein pairs from the CESSM online tool. This metric achieves good performance compared to the existing annotation-based GO measures. We used this new metric to assess functional similarity between orthologues, and show that it is effective at determining whether orthologues are annotated with similar functions and identifying cases where annotation is inconsistent between orthologues.
format Online
Article
Text
id pubmed-3361142
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-33611422012-06-04 A Topology-Based Metric for Measuring Term Similarity in the Gene Ontology Mazandu, Gaston K. Mulder, Nicola J. Adv Bioinformatics Research Article The wide coverage and biological relevance of the Gene Ontology (GO), confirmed through its successful use in protein function prediction, have led to the growth in its popularity. In order to exploit the extent of biological knowledge that GO offers in describing genes or groups of genes, there is a need for an efficient, scalable similarity measure for GO terms and GO-annotated proteins. While several GO similarity measures exist, none adequately addresses all issues surrounding the design and usage of the ontology. We introduce a new metric for measuring the distance between two GO terms using the intrinsic topology of the GO-DAG, thus enabling the measurement of functional similarities between proteins based on their GO annotations. We assess the performance of this metric using a ROC analysis on human protein-protein interaction datasets and correlation coefficient analysis on the selected set of protein pairs from the CESSM online tool. This metric achieves good performance compared to the existing annotation-based GO measures. We used this new metric to assess functional similarity between orthologues, and show that it is effective at determining whether orthologues are annotated with similar functions and identifying cases where annotation is inconsistent between orthologues. Hindawi Publishing Corporation 2012 2012-05-15 /pmc/articles/PMC3361142/ /pubmed/22666244 http://dx.doi.org/10.1155/2012/975783 Text en Copyright © 2012 G. K. Mazandu and N. J. Mulder. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Mazandu, Gaston K.
Mulder, Nicola J.
A Topology-Based Metric for Measuring Term Similarity in the Gene Ontology
title A Topology-Based Metric for Measuring Term Similarity in the Gene Ontology
title_full A Topology-Based Metric for Measuring Term Similarity in the Gene Ontology
title_fullStr A Topology-Based Metric for Measuring Term Similarity in the Gene Ontology
title_full_unstemmed A Topology-Based Metric for Measuring Term Similarity in the Gene Ontology
title_short A Topology-Based Metric for Measuring Term Similarity in the Gene Ontology
title_sort topology-based metric for measuring term similarity in the gene ontology
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3361142/
https://www.ncbi.nlm.nih.gov/pubmed/22666244
http://dx.doi.org/10.1155/2012/975783
work_keys_str_mv AT mazandugastonk atopologybasedmetricformeasuringtermsimilarityinthegeneontology
AT muldernicolaj atopologybasedmetricformeasuringtermsimilarityinthegeneontology
AT mazandugastonk topologybasedmetricformeasuringtermsimilarityinthegeneontology
AT muldernicolaj topologybasedmetricformeasuringtermsimilarityinthegeneontology