Cargando…

A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology

BACKGROUND: Recently, with the foundation and development of gene ontology (GO) resources, numerous works have been proposed to compute functional similarity of genes and achieved series of successes in some research fields. Focusing on the calculation of the information content (IC) of terms is the...

Descripción completa

Detalles Bibliográficos
Autores principales: Tian, Zhen, Fang, Haichuan, Ye, Yangdong, Zhu, Zhenfeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8772239/
https://www.ncbi.nlm.nih.gov/pubmed/35057740
http://dx.doi.org/10.1186/s12859-022-04557-6
_version_ 1784635803347976192
author Tian, Zhen
Fang, Haichuan
Ye, Yangdong
Zhu, Zhenfeng
author_facet Tian, Zhen
Fang, Haichuan
Ye, Yangdong
Zhu, Zhenfeng
author_sort Tian, Zhen
collection PubMed
description BACKGROUND: Recently, with the foundation and development of gene ontology (GO) resources, numerous works have been proposed to compute functional similarity of genes and achieved series of successes in some research fields. Focusing on the calculation of the information content (IC) of terms is the main idea of these methods, which is essential for measuring functional similarity of genes. However, most approaches have some deficiencies, especially when measuring the IC of both GO terms and their corresponding annotated term sets. To this end, measuring functional similarity of genes accurately is still challenging. RESULTS: In this article, we proposed a novel gene functional similarity calculation method, which especially encapsulates the specificity of terms and edges (STE). The proposed method mainly contains three steps. Firstly, a novel computing model is put forward to compute the IC of terms. This model has the ability to exploit the specific structural information of GO terms. Secondly, the IC of term sets are computed by capturing the genetic structure between the terms contained in the set. Lastly, we measure the gene functional similarity according to the IC overlap ratio of the corresponding annotated genes sets. The proposed method accurately measures the IC of not only GO terms but also the annotated term sets by leveraging the specificity of edges in the GO graph. CONCLUSIONS: We conduct experiments on gene functional classification in biological pathways, gene expression datasets, and protein-protein interaction datasets. Extensive experimental results show the better performances of our proposed STE against several baseline methods.
format Online
Article
Text
id pubmed-8772239
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-87722392022-01-21 A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology Tian, Zhen Fang, Haichuan Ye, Yangdong Zhu, Zhenfeng BMC Bioinformatics Research BACKGROUND: Recently, with the foundation and development of gene ontology (GO) resources, numerous works have been proposed to compute functional similarity of genes and achieved series of successes in some research fields. Focusing on the calculation of the information content (IC) of terms is the main idea of these methods, which is essential for measuring functional similarity of genes. However, most approaches have some deficiencies, especially when measuring the IC of both GO terms and their corresponding annotated term sets. To this end, measuring functional similarity of genes accurately is still challenging. RESULTS: In this article, we proposed a novel gene functional similarity calculation method, which especially encapsulates the specificity of terms and edges (STE). The proposed method mainly contains three steps. Firstly, a novel computing model is put forward to compute the IC of terms. This model has the ability to exploit the specific structural information of GO terms. Secondly, the IC of term sets are computed by capturing the genetic structure between the terms contained in the set. Lastly, we measure the gene functional similarity according to the IC overlap ratio of the corresponding annotated genes sets. The proposed method accurately measures the IC of not only GO terms but also the annotated term sets by leveraging the specificity of edges in the GO graph. CONCLUSIONS: We conduct experiments on gene functional classification in biological pathways, gene expression datasets, and protein-protein interaction datasets. Extensive experimental results show the better performances of our proposed STE against several baseline methods. BioMed Central 2022-01-20 /pmc/articles/PMC8772239/ /pubmed/35057740 http://dx.doi.org/10.1186/s12859-022-04557-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Tian, Zhen
Fang, Haichuan
Ye, Yangdong
Zhu, Zhenfeng
A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology
title A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology
title_full A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology
title_fullStr A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology
title_full_unstemmed A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology
title_short A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology
title_sort novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8772239/
https://www.ncbi.nlm.nih.gov/pubmed/35057740
http://dx.doi.org/10.1186/s12859-022-04557-6
work_keys_str_mv AT tianzhen anovelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology
AT fanghaichuan anovelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology
AT yeyangdong anovelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology
AT zhuzhenfeng anovelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology
AT tianzhen novelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology
AT fanghaichuan novelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology
AT yeyangdong novelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology
AT zhuzhenfeng novelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology