Cargando…

A new method to measure the semantic similarity from query phenotypic abnormalities to diseases based on the human phenotype ontology

BACKGROUND: Although rapid developed sequencing technologies make it possible for genotype data to be used in clinical diagnosis, it is still challenging for clinicians to understand the results of sequencing and make correct judgement based on them. Before this, diagnosis based on clinical features...

Descripción completa

Detalles Bibliográficos
Autores principales: Gong, Xiaofeng, Jiang, Jianping, Duan, Zhongqu, Lu, Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5998886/
https://www.ncbi.nlm.nih.gov/pubmed/29745853
http://dx.doi.org/10.1186/s12859-018-2064-y
_version_ 1783331322855424000
author Gong, Xiaofeng
Jiang, Jianping
Duan, Zhongqu
Lu, Hui
author_facet Gong, Xiaofeng
Jiang, Jianping
Duan, Zhongqu
Lu, Hui
author_sort Gong, Xiaofeng
collection PubMed
description BACKGROUND: Although rapid developed sequencing technologies make it possible for genotype data to be used in clinical diagnosis, it is still challenging for clinicians to understand the results of sequencing and make correct judgement based on them. Before this, diagnosis based on clinical features held a leading position. With the establishment of the Human Phenotype Ontology (HPO) and the enrichment of phenotype-disease annotations, there throws much more attention to the improvement of phenotype-based diagnosis. RESULTS: In this study, we presented a novel method called RelativeBestPair to measure similarity from the query terms to hereditary diseases based on HPO and then rank the candidate diseases. To evaluate the performance, we simulated a set of patients based on 44 complex diseases. Besides, by adding noise or imprecision or both, cases closer to real clinical conditions were generated. Thus, four simulated datasets were used to make comparison among RelativeBestPair and seven existing semantic similarity measures. RelativeBestPair ranked the underlying disease as top 1 on 93.73% of the simulated dataset without noise and imprecision, 93.64% of the simulated dataset with noise and without imprecision, 39.82% of the simulated dataset without noise and with imprecision, and 33.64% of the simulated dataset with both noise and imprecision. CONCLUSION: Compared with the seven existing semantic similarity measures, RelativeBestPair showed similar performance in two datasets without imprecision. While RelativeBestPair appeared to be equal to Resnik and better than other six methods in the simulated dataset without noise and with imprecision, it significantly outperformed all other seven methods in the simulated dataset with both noise and imprecision. It can be indicated that RelativeBestPair might be of great help in clinical setting.
format Online
Article
Text
id pubmed-5998886
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-59988862018-06-25 A new method to measure the semantic similarity from query phenotypic abnormalities to diseases based on the human phenotype ontology Gong, Xiaofeng Jiang, Jianping Duan, Zhongqu Lu, Hui BMC Bioinformatics Research BACKGROUND: Although rapid developed sequencing technologies make it possible for genotype data to be used in clinical diagnosis, it is still challenging for clinicians to understand the results of sequencing and make correct judgement based on them. Before this, diagnosis based on clinical features held a leading position. With the establishment of the Human Phenotype Ontology (HPO) and the enrichment of phenotype-disease annotations, there throws much more attention to the improvement of phenotype-based diagnosis. RESULTS: In this study, we presented a novel method called RelativeBestPair to measure similarity from the query terms to hereditary diseases based on HPO and then rank the candidate diseases. To evaluate the performance, we simulated a set of patients based on 44 complex diseases. Besides, by adding noise or imprecision or both, cases closer to real clinical conditions were generated. Thus, four simulated datasets were used to make comparison among RelativeBestPair and seven existing semantic similarity measures. RelativeBestPair ranked the underlying disease as top 1 on 93.73% of the simulated dataset without noise and imprecision, 93.64% of the simulated dataset with noise and without imprecision, 39.82% of the simulated dataset without noise and with imprecision, and 33.64% of the simulated dataset with both noise and imprecision. CONCLUSION: Compared with the seven existing semantic similarity measures, RelativeBestPair showed similar performance in two datasets without imprecision. While RelativeBestPair appeared to be equal to Resnik and better than other six methods in the simulated dataset without noise and with imprecision, it significantly outperformed all other seven methods in the simulated dataset with both noise and imprecision. It can be indicated that RelativeBestPair might be of great help in clinical setting. BioMed Central 2018-05-08 /pmc/articles/PMC5998886/ /pubmed/29745853 http://dx.doi.org/10.1186/s12859-018-2064-y Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Gong, Xiaofeng
Jiang, Jianping
Duan, Zhongqu
Lu, Hui
A new method to measure the semantic similarity from query phenotypic abnormalities to diseases based on the human phenotype ontology
title A new method to measure the semantic similarity from query phenotypic abnormalities to diseases based on the human phenotype ontology
title_full A new method to measure the semantic similarity from query phenotypic abnormalities to diseases based on the human phenotype ontology
title_fullStr A new method to measure the semantic similarity from query phenotypic abnormalities to diseases based on the human phenotype ontology
title_full_unstemmed A new method to measure the semantic similarity from query phenotypic abnormalities to diseases based on the human phenotype ontology
title_short A new method to measure the semantic similarity from query phenotypic abnormalities to diseases based on the human phenotype ontology
title_sort new method to measure the semantic similarity from query phenotypic abnormalities to diseases based on the human phenotype ontology
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5998886/
https://www.ncbi.nlm.nih.gov/pubmed/29745853
http://dx.doi.org/10.1186/s12859-018-2064-y
work_keys_str_mv AT gongxiaofeng anewmethodtomeasurethesemanticsimilarityfromqueryphenotypicabnormalitiestodiseasesbasedonthehumanphenotypeontology
AT jiangjianping anewmethodtomeasurethesemanticsimilarityfromqueryphenotypicabnormalitiestodiseasesbasedonthehumanphenotypeontology
AT duanzhongqu anewmethodtomeasurethesemanticsimilarityfromqueryphenotypicabnormalitiestodiseasesbasedonthehumanphenotypeontology
AT luhui anewmethodtomeasurethesemanticsimilarityfromqueryphenotypicabnormalitiestodiseasesbasedonthehumanphenotypeontology
AT gongxiaofeng newmethodtomeasurethesemanticsimilarityfromqueryphenotypicabnormalitiestodiseasesbasedonthehumanphenotypeontology
AT jiangjianping newmethodtomeasurethesemanticsimilarityfromqueryphenotypicabnormalitiestodiseasesbasedonthehumanphenotypeontology
AT duanzhongqu newmethodtomeasurethesemanticsimilarityfromqueryphenotypicabnormalitiestodiseasesbasedonthehumanphenotypeontology
AT luhui newmethodtomeasurethesemanticsimilarityfromqueryphenotypicabnormalitiestodiseasesbasedonthehumanphenotypeontology