Cargando…

Bi-directional semantic similarity for gene ontology to optimize biological and clinical analyses

BACKGROUND: Semantic similarity analysis facilitates automated semantic explanations of biological and clinical data annotated by biomedical ontologies. Gene ontology (GO) has become one of the most important biomedical ontologies with a set of controlled vocabularies, providing rich semantic annota...

Descripción completa

Detalles Bibliográficos
Autores principales: Bien, Sang Jay, Park, Chan Hee, Shim, Hae Jin, Yang, Woongcheol, Kim, Jihun, Kim, Ju Han
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Group 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3422825/
https://www.ncbi.nlm.nih.gov/pubmed/22374934
http://dx.doi.org/10.1136/amiajnl-2011-000659
_version_ 1782241061027971072
author Bien, Sang Jay
Park, Chan Hee
Shim, Hae Jin
Yang, Woongcheol
Kim, Jihun
Kim, Ju Han
author_facet Bien, Sang Jay
Park, Chan Hee
Shim, Hae Jin
Yang, Woongcheol
Kim, Jihun
Kim, Ju Han
author_sort Bien, Sang Jay
collection PubMed
description BACKGROUND: Semantic similarity analysis facilitates automated semantic explanations of biological and clinical data annotated by biomedical ontologies. Gene ontology (GO) has become one of the most important biomedical ontologies with a set of controlled vocabularies, providing rich semantic annotations for genes and molecular phenotypes for diseases. Current methods for measuring GO semantic similarities are limited to considering only the ancestor terms while neglecting the descendants. One can find many GO term pairs whose ancestors are identical but whose descendants are very different and vice versa. Moreover, the lower parts of GO trees are full of terms with more specific semantics. METHODS: This study proposed a method of measuring semantic similarities between GO terms using the entire GO tree structure, including both the upper (ancestral) and the lower (descendant) parts. Comprehensive comparison studies were performed with well-known information content-based and graph structure-based semantic similarity measures with protein sequence similarities, gene expression-profile correlations, protein–protein interactions, and biological pathway analyses. CONCLUSION: The proposed bidirectional measure of semantic similarity outperformed other graph-based and information content-based methods.
format Online
Article
Text
id pubmed-3422825
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BMJ Group
record_format MEDLINE/PubMed
spelling pubmed-34228252012-08-20 Bi-directional semantic similarity for gene ontology to optimize biological and clinical analyses Bien, Sang Jay Park, Chan Hee Shim, Hae Jin Yang, Woongcheol Kim, Jihun Kim, Ju Han J Am Med Inform Assoc Research and Applications BACKGROUND: Semantic similarity analysis facilitates automated semantic explanations of biological and clinical data annotated by biomedical ontologies. Gene ontology (GO) has become one of the most important biomedical ontologies with a set of controlled vocabularies, providing rich semantic annotations for genes and molecular phenotypes for diseases. Current methods for measuring GO semantic similarities are limited to considering only the ancestor terms while neglecting the descendants. One can find many GO term pairs whose ancestors are identical but whose descendants are very different and vice versa. Moreover, the lower parts of GO trees are full of terms with more specific semantics. METHODS: This study proposed a method of measuring semantic similarities between GO terms using the entire GO tree structure, including both the upper (ancestral) and the lower (descendant) parts. Comprehensive comparison studies were performed with well-known information content-based and graph structure-based semantic similarity measures with protein sequence similarities, gene expression-profile correlations, protein–protein interactions, and biological pathway analyses. CONCLUSION: The proposed bidirectional measure of semantic similarity outperformed other graph-based and information content-based methods. BMJ Group 2012-02-28 2012 /pmc/articles/PMC3422825/ /pubmed/22374934 http://dx.doi.org/10.1136/amiajnl-2011-000659 Text en © 2012, Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions. This is an open-access article distributed under the terms of the Creative Commons Attribution Non-commercial License, which permits use, distribution, and reproduction in any medium, provided the original work is properly cited, the use is non commercial and is otherwise in compliance with the license. See: http://creativecommons.org/licenses/by-nc/2.0/ and http://creativecommons.org/licenses/by-nc/2.0/legalcode.
spellingShingle Research and Applications
Bien, Sang Jay
Park, Chan Hee
Shim, Hae Jin
Yang, Woongcheol
Kim, Jihun
Kim, Ju Han
Bi-directional semantic similarity for gene ontology to optimize biological and clinical analyses
title Bi-directional semantic similarity for gene ontology to optimize biological and clinical analyses
title_full Bi-directional semantic similarity for gene ontology to optimize biological and clinical analyses
title_fullStr Bi-directional semantic similarity for gene ontology to optimize biological and clinical analyses
title_full_unstemmed Bi-directional semantic similarity for gene ontology to optimize biological and clinical analyses
title_short Bi-directional semantic similarity for gene ontology to optimize biological and clinical analyses
title_sort bi-directional semantic similarity for gene ontology to optimize biological and clinical analyses
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3422825/
https://www.ncbi.nlm.nih.gov/pubmed/22374934
http://dx.doi.org/10.1136/amiajnl-2011-000659
work_keys_str_mv AT biensangjay bidirectionalsemanticsimilarityforgeneontologytooptimizebiologicalandclinicalanalyses
AT parkchanhee bidirectionalsemanticsimilarityforgeneontologytooptimizebiologicalandclinicalanalyses
AT shimhaejin bidirectionalsemanticsimilarityforgeneontologytooptimizebiologicalandclinicalanalyses
AT yangwoongcheol bidirectionalsemanticsimilarityforgeneontologytooptimizebiologicalandclinicalanalyses
AT kimjihun bidirectionalsemanticsimilarityforgeneontologytooptimizebiologicalandclinicalanalyses
AT kimjuhan bidirectionalsemanticsimilarityforgeneontologytooptimizebiologicalandclinicalanalyses