Cargando…

Measuring semantic similarities by combining gene ontology annotations and gene co-function networks

BACKGROUND: Gene Ontology (GO) has been used widely to study functional relationships between genes. The current semantic similarity measures rely only on GO annotations and GO structure. This limits the power of GO-based similarity because of the limited proportion of genes that are annotated to GO...

Descripción completa

Detalles Bibliográficos
Autores principales: Peng, Jiajie, Uygun, Sahra, Kim, Taehyong, Wang, Yadong, Rhee, Seung Y, Chen, Jin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4339680/
https://www.ncbi.nlm.nih.gov/pubmed/25886899
http://dx.doi.org/10.1186/s12859-015-0474-7
_version_ 1782358898355732480
author Peng, Jiajie
Uygun, Sahra
Kim, Taehyong
Wang, Yadong
Rhee, Seung Y
Chen, Jin
author_facet Peng, Jiajie
Uygun, Sahra
Kim, Taehyong
Wang, Yadong
Rhee, Seung Y
Chen, Jin
author_sort Peng, Jiajie
collection PubMed
description BACKGROUND: Gene Ontology (GO) has been used widely to study functional relationships between genes. The current semantic similarity measures rely only on GO annotations and GO structure. This limits the power of GO-based similarity because of the limited proportion of genes that are annotated to GO in most organisms. RESULTS: We introduce a novel approach called NETSIM (network-based similarity measure) that incorporates information from gene co-function networks in addition to using the GO structure and annotations. Using metabolic reaction maps of yeast, Arabidopsis, and human, we demonstrate that NETSIM can improve the accuracy of GO term similarities. We also demonstrate that NETSIM works well even for genomes with sparser gene annotation data. We applied NETSIM on large Arabidopsis gene families such as cytochrome P450 monooxygenases to group the members functionally and show that this grouping could facilitate functional characterization of genes in these families. CONCLUSIONS: Using NETSIM as an example, we demonstrated that the performance of a semantic similarity measure could be significantly improved after incorporating genome-specific information. NETSIM incorporates both GO annotations and gene co-function network data as a priori knowledge in the model. Therefore, functional similarities of GO terms that are not explicitly encoded in GO but are relevant in a taxon-specific manner become measurable when GO annotations are limited. Supplementary information and software are available at http://www.msu.edu/~jinchen/NETSIM. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0474-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4339680
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43396802015-02-26 Measuring semantic similarities by combining gene ontology annotations and gene co-function networks Peng, Jiajie Uygun, Sahra Kim, Taehyong Wang, Yadong Rhee, Seung Y Chen, Jin BMC Bioinformatics Methodology Article BACKGROUND: Gene Ontology (GO) has been used widely to study functional relationships between genes. The current semantic similarity measures rely only on GO annotations and GO structure. This limits the power of GO-based similarity because of the limited proportion of genes that are annotated to GO in most organisms. RESULTS: We introduce a novel approach called NETSIM (network-based similarity measure) that incorporates information from gene co-function networks in addition to using the GO structure and annotations. Using metabolic reaction maps of yeast, Arabidopsis, and human, we demonstrate that NETSIM can improve the accuracy of GO term similarities. We also demonstrate that NETSIM works well even for genomes with sparser gene annotation data. We applied NETSIM on large Arabidopsis gene families such as cytochrome P450 monooxygenases to group the members functionally and show that this grouping could facilitate functional characterization of genes in these families. CONCLUSIONS: Using NETSIM as an example, we demonstrated that the performance of a semantic similarity measure could be significantly improved after incorporating genome-specific information. NETSIM incorporates both GO annotations and gene co-function network data as a priori knowledge in the model. Therefore, functional similarities of GO terms that are not explicitly encoded in GO but are relevant in a taxon-specific manner become measurable when GO annotations are limited. Supplementary information and software are available at http://www.msu.edu/~jinchen/NETSIM. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0474-7) contains supplementary material, which is available to authorized users. BioMed Central 2015-02-14 /pmc/articles/PMC4339680/ /pubmed/25886899 http://dx.doi.org/10.1186/s12859-015-0474-7 Text en © Peng et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Peng, Jiajie
Uygun, Sahra
Kim, Taehyong
Wang, Yadong
Rhee, Seung Y
Chen, Jin
Measuring semantic similarities by combining gene ontology annotations and gene co-function networks
title Measuring semantic similarities by combining gene ontology annotations and gene co-function networks
title_full Measuring semantic similarities by combining gene ontology annotations and gene co-function networks
title_fullStr Measuring semantic similarities by combining gene ontology annotations and gene co-function networks
title_full_unstemmed Measuring semantic similarities by combining gene ontology annotations and gene co-function networks
title_short Measuring semantic similarities by combining gene ontology annotations and gene co-function networks
title_sort measuring semantic similarities by combining gene ontology annotations and gene co-function networks
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4339680/
https://www.ncbi.nlm.nih.gov/pubmed/25886899
http://dx.doi.org/10.1186/s12859-015-0474-7
work_keys_str_mv AT pengjiajie measuringsemanticsimilaritiesbycombininggeneontologyannotationsandgenecofunctionnetworks
AT uygunsahra measuringsemanticsimilaritiesbycombininggeneontologyannotationsandgenecofunctionnetworks
AT kimtaehyong measuringsemanticsimilaritiesbycombininggeneontologyannotationsandgenecofunctionnetworks
AT wangyadong measuringsemanticsimilaritiesbycombininggeneontologyannotationsandgenecofunctionnetworks
AT rheeseungy measuringsemanticsimilaritiesbycombininggeneontologyannotationsandgenecofunctionnetworks
AT chenjin measuringsemanticsimilaritiesbycombininggeneontologyannotationsandgenecofunctionnetworks