Cargando…

GOGO: An improved algorithm to measure the semantic similarity between gene ontology terms

Measuring the semantic similarity between Gene Ontology (GO) terms is an essential step in functional bioinformatics research. We implemented a software named GOGO for calculating the semantic similarity between GO terms. GOGO has the advantages of both information-content-based and hybrid methods,...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Chenguang, Wang, Zheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6180005/
https://www.ncbi.nlm.nih.gov/pubmed/30305653
http://dx.doi.org/10.1038/s41598-018-33219-y
_version_ 1783362111848579072
author Zhao, Chenguang
Wang, Zheng
author_facet Zhao, Chenguang
Wang, Zheng
author_sort Zhao, Chenguang
collection PubMed
description Measuring the semantic similarity between Gene Ontology (GO) terms is an essential step in functional bioinformatics research. We implemented a software named GOGO for calculating the semantic similarity between GO terms. GOGO has the advantages of both information-content-based and hybrid methods, such as Resnik’s and Wang’s methods. Moreover, GOGO is relatively fast and does not need to calculate information content (IC) from a large gene annotation corpus but still has the advantage of using IC. This is achieved by considering the number of children nodes in the GO directed acyclic graphs when calculating the semantic contribution of an ancestor node giving to its descendent nodes. GOGO can calculate functional similarities between genes and then cluster genes based on their functional similarities. Evaluations performed on multiple pathways retrieved from the saccharomyces genome database (SGD) show that GOGO can accurately and robustly cluster genes based on functional similarities. We release GOGO as a web server and also as a stand-alone tool, which allows convenient execution of the tool for a small number of GO terms or integration of the tool into bioinformatics pipelines for large-scale calculations. GOGO can be freely accessed or downloaded from http://dna.cs.miami.edu/GOGO/.
format Online
Article
Text
id pubmed-6180005
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-61800052018-10-15 GOGO: An improved algorithm to measure the semantic similarity between gene ontology terms Zhao, Chenguang Wang, Zheng Sci Rep Article Measuring the semantic similarity between Gene Ontology (GO) terms is an essential step in functional bioinformatics research. We implemented a software named GOGO for calculating the semantic similarity between GO terms. GOGO has the advantages of both information-content-based and hybrid methods, such as Resnik’s and Wang’s methods. Moreover, GOGO is relatively fast and does not need to calculate information content (IC) from a large gene annotation corpus but still has the advantage of using IC. This is achieved by considering the number of children nodes in the GO directed acyclic graphs when calculating the semantic contribution of an ancestor node giving to its descendent nodes. GOGO can calculate functional similarities between genes and then cluster genes based on their functional similarities. Evaluations performed on multiple pathways retrieved from the saccharomyces genome database (SGD) show that GOGO can accurately and robustly cluster genes based on functional similarities. We release GOGO as a web server and also as a stand-alone tool, which allows convenient execution of the tool for a small number of GO terms or integration of the tool into bioinformatics pipelines for large-scale calculations. GOGO can be freely accessed or downloaded from http://dna.cs.miami.edu/GOGO/. Nature Publishing Group UK 2018-10-10 /pmc/articles/PMC6180005/ /pubmed/30305653 http://dx.doi.org/10.1038/s41598-018-33219-y Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Zhao, Chenguang
Wang, Zheng
GOGO: An improved algorithm to measure the semantic similarity between gene ontology terms
title GOGO: An improved algorithm to measure the semantic similarity between gene ontology terms
title_full GOGO: An improved algorithm to measure the semantic similarity between gene ontology terms
title_fullStr GOGO: An improved algorithm to measure the semantic similarity between gene ontology terms
title_full_unstemmed GOGO: An improved algorithm to measure the semantic similarity between gene ontology terms
title_short GOGO: An improved algorithm to measure the semantic similarity between gene ontology terms
title_sort gogo: an improved algorithm to measure the semantic similarity between gene ontology terms
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6180005/
https://www.ncbi.nlm.nih.gov/pubmed/30305653
http://dx.doi.org/10.1038/s41598-018-33219-y
work_keys_str_mv AT zhaochenguang gogoanimprovedalgorithmtomeasurethesemanticsimilaritybetweengeneontologyterms
AT wangzheng gogoanimprovedalgorithmtomeasurethesemanticsimilaritybetweengeneontologyterms