Cargando…

Information Content-Based Gene Ontology Semantic Similarity Approaches: Toward a Unified Framework Theory

Several approaches have been proposed for computing term information content (IC) and semantic similarity scores within the gene ontology (GO) directed acyclic graph (DAG). These approaches contributed to improving protein analyses at the functional level. Considering the recent proliferation of the...

Descripción completa

Detalles Bibliográficos
Autores principales: Mazandu, Gaston K., Mulder, Nicola J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3775452/
https://www.ncbi.nlm.nih.gov/pubmed/24078912
http://dx.doi.org/10.1155/2013/292063
_version_ 1782477384560148480
author Mazandu, Gaston K.
Mulder, Nicola J.
author_facet Mazandu, Gaston K.
Mulder, Nicola J.
author_sort Mazandu, Gaston K.
collection PubMed
description Several approaches have been proposed for computing term information content (IC) and semantic similarity scores within the gene ontology (GO) directed acyclic graph (DAG). These approaches contributed to improving protein analyses at the functional level. Considering the recent proliferation of these approaches, a unified theory in a well-defined mathematical framework is necessary in order to provide a theoretical basis for validating these approaches. We review the existing IC-based ontological similarity approaches developed in the context of biomedical and bioinformatics fields to propose a general framework and unified description of all these measures. We have conducted an experimental evaluation to assess the impact of IC approaches, different normalization models, and correction factors on the performance of a functional similarity metric. Results reveal that considering only parents or only children of terms when assessing information content or semantic similarity scores negatively impacts the approach under consideration. This study produces a unified framework for current and future GO semantic similarity measures and provides theoretical basics for comparing different approaches. The experimental evaluation of different approaches based on different term information content models paves the way towards a solution to the issue of scoring a term's specificity in the GO DAG.
format Online
Article
Text
id pubmed-3775452
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-37754522013-09-29 Information Content-Based Gene Ontology Semantic Similarity Approaches: Toward a Unified Framework Theory Mazandu, Gaston K. Mulder, Nicola J. Biomed Res Int Research Article Several approaches have been proposed for computing term information content (IC) and semantic similarity scores within the gene ontology (GO) directed acyclic graph (DAG). These approaches contributed to improving protein analyses at the functional level. Considering the recent proliferation of these approaches, a unified theory in a well-defined mathematical framework is necessary in order to provide a theoretical basis for validating these approaches. We review the existing IC-based ontological similarity approaches developed in the context of biomedical and bioinformatics fields to propose a general framework and unified description of all these measures. We have conducted an experimental evaluation to assess the impact of IC approaches, different normalization models, and correction factors on the performance of a functional similarity metric. Results reveal that considering only parents or only children of terms when assessing information content or semantic similarity scores negatively impacts the approach under consideration. This study produces a unified framework for current and future GO semantic similarity measures and provides theoretical basics for comparing different approaches. The experimental evaluation of different approaches based on different term information content models paves the way towards a solution to the issue of scoring a term's specificity in the GO DAG. Hindawi Publishing Corporation 2013 2013-09-02 /pmc/articles/PMC3775452/ /pubmed/24078912 http://dx.doi.org/10.1155/2013/292063 Text en Copyright © 2013 G. K. Mazandu and N. J. Mulder. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Mazandu, Gaston K.
Mulder, Nicola J.
Information Content-Based Gene Ontology Semantic Similarity Approaches: Toward a Unified Framework Theory
title Information Content-Based Gene Ontology Semantic Similarity Approaches: Toward a Unified Framework Theory
title_full Information Content-Based Gene Ontology Semantic Similarity Approaches: Toward a Unified Framework Theory
title_fullStr Information Content-Based Gene Ontology Semantic Similarity Approaches: Toward a Unified Framework Theory
title_full_unstemmed Information Content-Based Gene Ontology Semantic Similarity Approaches: Toward a Unified Framework Theory
title_short Information Content-Based Gene Ontology Semantic Similarity Approaches: Toward a Unified Framework Theory
title_sort information content-based gene ontology semantic similarity approaches: toward a unified framework theory
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3775452/
https://www.ncbi.nlm.nih.gov/pubmed/24078912
http://dx.doi.org/10.1155/2013/292063
work_keys_str_mv AT mazandugastonk informationcontentbasedgeneontologysemanticsimilarityapproachestowardaunifiedframeworktheory
AT muldernicolaj informationcontentbasedgeneontologysemanticsimilarityapproachestowardaunifiedframeworktheory