Cargando…

SemFunSim: A New Method for Measuring Disease Similarity by Integrating Semantic and Gene Functional Association

BACKGROUND: Measuring similarity between diseases plays an important role in disease-related molecular function research. Functional associations between disease-related genes and semantic associations between diseases are often used to identify pairs of similar diseases from different perspectives....

Descripción completa

Detalles Bibliográficos
Autores principales: Cheng, Liang, Li, Jie, Ju, Peng, Peng, Jiajie, Wang, Yadong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4059643/
https://www.ncbi.nlm.nih.gov/pubmed/24932637
http://dx.doi.org/10.1371/journal.pone.0099415
_version_ 1782321261422051328
author Cheng, Liang
Li, Jie
Ju, Peng
Peng, Jiajie
Wang, Yadong
author_facet Cheng, Liang
Li, Jie
Ju, Peng
Peng, Jiajie
Wang, Yadong
author_sort Cheng, Liang
collection PubMed
description BACKGROUND: Measuring similarity between diseases plays an important role in disease-related molecular function research. Functional associations between disease-related genes and semantic associations between diseases are often used to identify pairs of similar diseases from different perspectives. Currently, it is still a challenge to exploit both of them to calculate disease similarity. Therefore, a new method (SemFunSim) that integrates semantic and functional association is proposed to address the issue. METHODS: SemFunSim is designed as follows. First of all, FunSim (Functional similarity) is proposed to calculate disease similarity using disease-related gene sets in a weighted network of human gene function. Next, SemSim (Semantic Similarity) is devised to calculate disease similarity using the relationship between two diseases from Disease Ontology. Finally, FunSim and SemSim are integrated to measure disease similarity. RESULTS: The high average AUC (area under the receiver operating characteristic curve) (96.37%) shows that SemFunSim achieves a high true positive rate and a low false positive rate. 79 of the top 100 pairs of similar diseases identified by SemFunSim are annotated in the Comparative Toxicogenomics Database (CTD) as being targeted by the same therapeutic compounds, while other methods we compared could identify 35 or less such pairs among the top 100. Moreover, when using our method on diseases without annotated compounds in CTD, we could confirm many of our predicted candidate compounds from literature. This indicates that SemFunSim is an effective method for drug repositioning.
format Online
Article
Text
id pubmed-4059643
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-40596432014-06-19 SemFunSim: A New Method for Measuring Disease Similarity by Integrating Semantic and Gene Functional Association Cheng, Liang Li, Jie Ju, Peng Peng, Jiajie Wang, Yadong PLoS One Research Article BACKGROUND: Measuring similarity between diseases plays an important role in disease-related molecular function research. Functional associations between disease-related genes and semantic associations between diseases are often used to identify pairs of similar diseases from different perspectives. Currently, it is still a challenge to exploit both of them to calculate disease similarity. Therefore, a new method (SemFunSim) that integrates semantic and functional association is proposed to address the issue. METHODS: SemFunSim is designed as follows. First of all, FunSim (Functional similarity) is proposed to calculate disease similarity using disease-related gene sets in a weighted network of human gene function. Next, SemSim (Semantic Similarity) is devised to calculate disease similarity using the relationship between two diseases from Disease Ontology. Finally, FunSim and SemSim are integrated to measure disease similarity. RESULTS: The high average AUC (area under the receiver operating characteristic curve) (96.37%) shows that SemFunSim achieves a high true positive rate and a low false positive rate. 79 of the top 100 pairs of similar diseases identified by SemFunSim are annotated in the Comparative Toxicogenomics Database (CTD) as being targeted by the same therapeutic compounds, while other methods we compared could identify 35 or less such pairs among the top 100. Moreover, when using our method on diseases without annotated compounds in CTD, we could confirm many of our predicted candidate compounds from literature. This indicates that SemFunSim is an effective method for drug repositioning. Public Library of Science 2014-06-16 /pmc/articles/PMC4059643/ /pubmed/24932637 http://dx.doi.org/10.1371/journal.pone.0099415 Text en © 2014 Cheng et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Cheng, Liang
Li, Jie
Ju, Peng
Peng, Jiajie
Wang, Yadong
SemFunSim: A New Method for Measuring Disease Similarity by Integrating Semantic and Gene Functional Association
title SemFunSim: A New Method for Measuring Disease Similarity by Integrating Semantic and Gene Functional Association
title_full SemFunSim: A New Method for Measuring Disease Similarity by Integrating Semantic and Gene Functional Association
title_fullStr SemFunSim: A New Method for Measuring Disease Similarity by Integrating Semantic and Gene Functional Association
title_full_unstemmed SemFunSim: A New Method for Measuring Disease Similarity by Integrating Semantic and Gene Functional Association
title_short SemFunSim: A New Method for Measuring Disease Similarity by Integrating Semantic and Gene Functional Association
title_sort semfunsim: a new method for measuring disease similarity by integrating semantic and gene functional association
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4059643/
https://www.ncbi.nlm.nih.gov/pubmed/24932637
http://dx.doi.org/10.1371/journal.pone.0099415
work_keys_str_mv AT chengliang semfunsimanewmethodformeasuringdiseasesimilaritybyintegratingsemanticandgenefunctionalassociation
AT lijie semfunsimanewmethodformeasuringdiseasesimilaritybyintegratingsemanticandgenefunctionalassociation
AT jupeng semfunsimanewmethodformeasuringdiseasesimilaritybyintegratingsemanticandgenefunctionalassociation
AT pengjiajie semfunsimanewmethodformeasuringdiseasesimilaritybyintegratingsemanticandgenefunctionalassociation
AT wangyadong semfunsimanewmethodformeasuringdiseasesimilaritybyintegratingsemanticandgenefunctionalassociation