Cargando…

An Effective Method to Measure Disease Similarity Using Gene and Phenotype Associations

Motivation: In order to create controlled vocabularies for shared use in different biomedical domains, a large number of biomedical ontologies such as Disease Ontology (DO) and Human Phenotype Ontology (HPO), etc., are created in the bioinformatics community. Quantitative measures of the association...

Descripción completa

Detalles Bibliográficos
Autores principales: Su, Shuhui, Zhang, Lei, Liu, Jian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6536643/
https://www.ncbi.nlm.nih.gov/pubmed/31164903
http://dx.doi.org/10.3389/fgene.2019.00466
_version_ 1783421801684008960
author Su, Shuhui
Zhang, Lei
Liu, Jian
author_facet Su, Shuhui
Zhang, Lei
Liu, Jian
author_sort Su, Shuhui
collection PubMed
description Motivation: In order to create controlled vocabularies for shared use in different biomedical domains, a large number of biomedical ontologies such as Disease Ontology (DO) and Human Phenotype Ontology (HPO), etc., are created in the bioinformatics community. Quantitative measures of the associations among diseases could help researchers gain a deep insight of human diseases, since similar diseases are usually caused by similar molecular origins or have similar phenotypes, which is beneficial to reveal the common attributes of diseases and improve the corresponding diagnoses and treatment plans. Some previous are proposed to measure the disease similarity using a particular biomedical ontology during the past few years, but for a newly discovered disease or a disease with few related genetic information in Disease Ontology (i.e., a disease with less disease-gene associations), these previous approaches usually ignores the joint computation of disease similarity by integrating gene and phenotype associations. Results: In this paper we propose a novel method called GPSim to effectively deduce the semantic similarity of diseases. In particular, GPSim calculates the similarity by jointly utilizing gene, disease and phenotype associations extracted from multiple biomedical ontologies and databases. We also explore the phenotypic factors such as the depth of HPO terms and the number of phenotypic associations that affect the evaluation performance. A final experimental evaluation is carried out to evaluate the performance of GPSim and shows its advantages over previous approaches.
format Online
Article
Text
id pubmed-6536643
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-65366432019-06-04 An Effective Method to Measure Disease Similarity Using Gene and Phenotype Associations Su, Shuhui Zhang, Lei Liu, Jian Front Genet Genetics Motivation: In order to create controlled vocabularies for shared use in different biomedical domains, a large number of biomedical ontologies such as Disease Ontology (DO) and Human Phenotype Ontology (HPO), etc., are created in the bioinformatics community. Quantitative measures of the associations among diseases could help researchers gain a deep insight of human diseases, since similar diseases are usually caused by similar molecular origins or have similar phenotypes, which is beneficial to reveal the common attributes of diseases and improve the corresponding diagnoses and treatment plans. Some previous are proposed to measure the disease similarity using a particular biomedical ontology during the past few years, but for a newly discovered disease or a disease with few related genetic information in Disease Ontology (i.e., a disease with less disease-gene associations), these previous approaches usually ignores the joint computation of disease similarity by integrating gene and phenotype associations. Results: In this paper we propose a novel method called GPSim to effectively deduce the semantic similarity of diseases. In particular, GPSim calculates the similarity by jointly utilizing gene, disease and phenotype associations extracted from multiple biomedical ontologies and databases. We also explore the phenotypic factors such as the depth of HPO terms and the number of phenotypic associations that affect the evaluation performance. A final experimental evaluation is carried out to evaluate the performance of GPSim and shows its advantages over previous approaches. Frontiers Media S.A. 2019-05-21 /pmc/articles/PMC6536643/ /pubmed/31164903 http://dx.doi.org/10.3389/fgene.2019.00466 Text en Copyright © 2019 Su, Zhang and Liu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Su, Shuhui
Zhang, Lei
Liu, Jian
An Effective Method to Measure Disease Similarity Using Gene and Phenotype Associations
title An Effective Method to Measure Disease Similarity Using Gene and Phenotype Associations
title_full An Effective Method to Measure Disease Similarity Using Gene and Phenotype Associations
title_fullStr An Effective Method to Measure Disease Similarity Using Gene and Phenotype Associations
title_full_unstemmed An Effective Method to Measure Disease Similarity Using Gene and Phenotype Associations
title_short An Effective Method to Measure Disease Similarity Using Gene and Phenotype Associations
title_sort effective method to measure disease similarity using gene and phenotype associations
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6536643/
https://www.ncbi.nlm.nih.gov/pubmed/31164903
http://dx.doi.org/10.3389/fgene.2019.00466
work_keys_str_mv AT sushuhui aneffectivemethodtomeasurediseasesimilarityusinggeneandphenotypeassociations
AT zhanglei aneffectivemethodtomeasurediseasesimilarityusinggeneandphenotypeassociations
AT liujian aneffectivemethodtomeasurediseasesimilarityusinggeneandphenotypeassociations
AT sushuhui effectivemethodtomeasurediseasesimilarityusinggeneandphenotypeassociations
AT zhanglei effectivemethodtomeasurediseasesimilarityusinggeneandphenotypeassociations
AT liujian effectivemethodtomeasurediseasesimilarityusinggeneandphenotypeassociations