Cargando…

Rare disease knowledge enrichment through a data-driven approach

BACKGROUND: Existing resources to assist the diagnosis of rare diseases are usually curated from the literature that can be limited for clinical use. It often takes substantial effort before the suspicion of a rare disease is even raised to utilize those resources. The primary goal of this study was...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Feichen, Zhao, Yiqing, Wang, Liwei, Mojarad, Majid Rastegar, Wang, Yanshan, Liu, Sijia, Liu, Hongfang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6376651/
https://www.ncbi.nlm.nih.gov/pubmed/30764825
http://dx.doi.org/10.1186/s12911-019-0752-9
_version_ 1783395598552006656
author Shen, Feichen
Zhao, Yiqing
Wang, Liwei
Mojarad, Majid Rastegar
Wang, Yanshan
Liu, Sijia
Liu, Hongfang
author_facet Shen, Feichen
Zhao, Yiqing
Wang, Liwei
Mojarad, Majid Rastegar
Wang, Yanshan
Liu, Sijia
Liu, Hongfang
author_sort Shen, Feichen
collection PubMed
description BACKGROUND: Existing resources to assist the diagnosis of rare diseases are usually curated from the literature that can be limited for clinical use. It often takes substantial effort before the suspicion of a rare disease is even raised to utilize those resources. The primary goal of this study was to apply a data-driven approach to enrich existing rare disease resources by mining phenotype-disease associations from electronic medical record (EMR). METHODS: We first applied association rule mining algorithms on EMR to extract significant phenotype-disease associations and enriched existing rare disease resources (Human Phenotype Ontology and Orphanet (HPO-Orphanet)). We generated phenotype-disease bipartite graphs for HPO-Orphanet, EMR, and enriched knowledge base HPO-Orphanet + and conducted a case study on Hodgkin lymphoma to compare performance on differential diagnosis among these three graphs. RESULTS: We used disease-disease similarity generated by the eRAM, an existing rare disease encyclopedia, as a gold standard to compare the three graphs with sensitivity and specificity as (0.17, 0.36, 0.46) and (0.52, 0.47, 0.51) for three graphs respectively. We also compared the top 15 diseases generated by the HPO-Orphanet + graph with eRAM and another clinical diagnostic tool, the Phenomizer. CONCLUSIONS: Per our evaluation results, our approach was able to enrich existing rare disease knowledge resources with phenotype-disease associations from EMR and thus support rare disease differential diagnosis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12911-019-0752-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6376651
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63766512019-02-27 Rare disease knowledge enrichment through a data-driven approach Shen, Feichen Zhao, Yiqing Wang, Liwei Mojarad, Majid Rastegar Wang, Yanshan Liu, Sijia Liu, Hongfang BMC Med Inform Decis Mak Research Article BACKGROUND: Existing resources to assist the diagnosis of rare diseases are usually curated from the literature that can be limited for clinical use. It often takes substantial effort before the suspicion of a rare disease is even raised to utilize those resources. The primary goal of this study was to apply a data-driven approach to enrich existing rare disease resources by mining phenotype-disease associations from electronic medical record (EMR). METHODS: We first applied association rule mining algorithms on EMR to extract significant phenotype-disease associations and enriched existing rare disease resources (Human Phenotype Ontology and Orphanet (HPO-Orphanet)). We generated phenotype-disease bipartite graphs for HPO-Orphanet, EMR, and enriched knowledge base HPO-Orphanet + and conducted a case study on Hodgkin lymphoma to compare performance on differential diagnosis among these three graphs. RESULTS: We used disease-disease similarity generated by the eRAM, an existing rare disease encyclopedia, as a gold standard to compare the three graphs with sensitivity and specificity as (0.17, 0.36, 0.46) and (0.52, 0.47, 0.51) for three graphs respectively. We also compared the top 15 diseases generated by the HPO-Orphanet + graph with eRAM and another clinical diagnostic tool, the Phenomizer. CONCLUSIONS: Per our evaluation results, our approach was able to enrich existing rare disease knowledge resources with phenotype-disease associations from EMR and thus support rare disease differential diagnosis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12911-019-0752-9) contains supplementary material, which is available to authorized users. BioMed Central 2019-02-14 /pmc/articles/PMC6376651/ /pubmed/30764825 http://dx.doi.org/10.1186/s12911-019-0752-9 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Shen, Feichen
Zhao, Yiqing
Wang, Liwei
Mojarad, Majid Rastegar
Wang, Yanshan
Liu, Sijia
Liu, Hongfang
Rare disease knowledge enrichment through a data-driven approach
title Rare disease knowledge enrichment through a data-driven approach
title_full Rare disease knowledge enrichment through a data-driven approach
title_fullStr Rare disease knowledge enrichment through a data-driven approach
title_full_unstemmed Rare disease knowledge enrichment through a data-driven approach
title_short Rare disease knowledge enrichment through a data-driven approach
title_sort rare disease knowledge enrichment through a data-driven approach
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6376651/
https://www.ncbi.nlm.nih.gov/pubmed/30764825
http://dx.doi.org/10.1186/s12911-019-0752-9
work_keys_str_mv AT shenfeichen rarediseaseknowledgeenrichmentthroughadatadrivenapproach
AT zhaoyiqing rarediseaseknowledgeenrichmentthroughadatadrivenapproach
AT wangliwei rarediseaseknowledgeenrichmentthroughadatadrivenapproach
AT mojaradmajidrastegar rarediseaseknowledgeenrichmentthroughadatadrivenapproach
AT wangyanshan rarediseaseknowledgeenrichmentthroughadatadrivenapproach
AT liusijia rarediseaseknowledgeenrichmentthroughadatadrivenapproach
AT liuhongfang rarediseaseknowledgeenrichmentthroughadatadrivenapproach