Cargando…
A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology
BACKGROUND: Recently, with the foundation and development of gene ontology (GO) resources, numerous works have been proposed to compute functional similarity of genes and achieved series of successes in some research fields. Focusing on the calculation of the information content (IC) of terms is the...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8772239/ https://www.ncbi.nlm.nih.gov/pubmed/35057740 http://dx.doi.org/10.1186/s12859-022-04557-6 |
_version_ | 1784635803347976192 |
---|---|
author | Tian, Zhen Fang, Haichuan Ye, Yangdong Zhu, Zhenfeng |
author_facet | Tian, Zhen Fang, Haichuan Ye, Yangdong Zhu, Zhenfeng |
author_sort | Tian, Zhen |
collection | PubMed |
description | BACKGROUND: Recently, with the foundation and development of gene ontology (GO) resources, numerous works have been proposed to compute functional similarity of genes and achieved series of successes in some research fields. Focusing on the calculation of the information content (IC) of terms is the main idea of these methods, which is essential for measuring functional similarity of genes. However, most approaches have some deficiencies, especially when measuring the IC of both GO terms and their corresponding annotated term sets. To this end, measuring functional similarity of genes accurately is still challenging. RESULTS: In this article, we proposed a novel gene functional similarity calculation method, which especially encapsulates the specificity of terms and edges (STE). The proposed method mainly contains three steps. Firstly, a novel computing model is put forward to compute the IC of terms. This model has the ability to exploit the specific structural information of GO terms. Secondly, the IC of term sets are computed by capturing the genetic structure between the terms contained in the set. Lastly, we measure the gene functional similarity according to the IC overlap ratio of the corresponding annotated genes sets. The proposed method accurately measures the IC of not only GO terms but also the annotated term sets by leveraging the specificity of edges in the GO graph. CONCLUSIONS: We conduct experiments on gene functional classification in biological pathways, gene expression datasets, and protein-protein interaction datasets. Extensive experimental results show the better performances of our proposed STE against several baseline methods. |
format | Online Article Text |
id | pubmed-8772239 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-87722392022-01-21 A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology Tian, Zhen Fang, Haichuan Ye, Yangdong Zhu, Zhenfeng BMC Bioinformatics Research BACKGROUND: Recently, with the foundation and development of gene ontology (GO) resources, numerous works have been proposed to compute functional similarity of genes and achieved series of successes in some research fields. Focusing on the calculation of the information content (IC) of terms is the main idea of these methods, which is essential for measuring functional similarity of genes. However, most approaches have some deficiencies, especially when measuring the IC of both GO terms and their corresponding annotated term sets. To this end, measuring functional similarity of genes accurately is still challenging. RESULTS: In this article, we proposed a novel gene functional similarity calculation method, which especially encapsulates the specificity of terms and edges (STE). The proposed method mainly contains three steps. Firstly, a novel computing model is put forward to compute the IC of terms. This model has the ability to exploit the specific structural information of GO terms. Secondly, the IC of term sets are computed by capturing the genetic structure between the terms contained in the set. Lastly, we measure the gene functional similarity according to the IC overlap ratio of the corresponding annotated genes sets. The proposed method accurately measures the IC of not only GO terms but also the annotated term sets by leveraging the specificity of edges in the GO graph. CONCLUSIONS: We conduct experiments on gene functional classification in biological pathways, gene expression datasets, and protein-protein interaction datasets. Extensive experimental results show the better performances of our proposed STE against several baseline methods. BioMed Central 2022-01-20 /pmc/articles/PMC8772239/ /pubmed/35057740 http://dx.doi.org/10.1186/s12859-022-04557-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Tian, Zhen Fang, Haichuan Ye, Yangdong Zhu, Zhenfeng A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology |
title | A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology |
title_full | A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology |
title_fullStr | A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology |
title_full_unstemmed | A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology |
title_short | A novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology |
title_sort | novel gene functional similarity calculation model by utilizing the specificity of terms and relationships in gene ontology |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8772239/ https://www.ncbi.nlm.nih.gov/pubmed/35057740 http://dx.doi.org/10.1186/s12859-022-04557-6 |
work_keys_str_mv | AT tianzhen anovelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology AT fanghaichuan anovelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology AT yeyangdong anovelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology AT zhuzhenfeng anovelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology AT tianzhen novelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology AT fanghaichuan novelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology AT yeyangdong novelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology AT zhuzhenfeng novelgenefunctionalsimilaritycalculationmodelbyutilizingthespecificityoftermsandrelationshipsingeneontology |