Cargando…

Using Domain Knowledge and Data-Driven Insights for Patient Similarity Analytics

Patient similarity analytics has emerged as an essential tool to identify cohorts of patients who have similar clinical characteristics to some specific patient of interest. In this study, we propose a patient similarity measure called D3K that incorporates domain knowledge and data-driven insights....

Descripción completa

Detalles Bibliográficos
Autores principales: Oei, Ronald Wihal, Fang, Hao Sen Andrew, Tan, Wei-Ying, Hsu, Wynne, Lee, Mong-Li, Tan, Ngiap-Chuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8398126/
https://www.ncbi.nlm.nih.gov/pubmed/34442343
http://dx.doi.org/10.3390/jpm11080699
_version_ 1783744763432796160
author Oei, Ronald Wihal
Fang, Hao Sen Andrew
Tan, Wei-Ying
Hsu, Wynne
Lee, Mong-Li
Tan, Ngiap-Chuan
author_facet Oei, Ronald Wihal
Fang, Hao Sen Andrew
Tan, Wei-Ying
Hsu, Wynne
Lee, Mong-Li
Tan, Ngiap-Chuan
author_sort Oei, Ronald Wihal
collection PubMed
description Patient similarity analytics has emerged as an essential tool to identify cohorts of patients who have similar clinical characteristics to some specific patient of interest. In this study, we propose a patient similarity measure called D3K that incorporates domain knowledge and data-driven insights. Using the electronic health records (EHRs) of 169,434 patients with either diabetes, hypertension or dyslipidaemia (DHL), we construct patient feature vectors containing demographics, vital signs, laboratory test results, and prescribed medications. We discretize the variables of interest into various bins based on domain knowledge and make the patient similarity computation to be aligned with clinical guidelines. Key findings from this study are: (1) D3K outperforms baseline approaches in all seven sub-cohorts; (2) our domain knowledge-based binning strategy outperformed the traditional percentile-based binning in all seven sub-cohorts; (3) there is substantial agreement between D3K and physicians (κ = 0.746), indicating that D3K can be applied to facilitate shared decision making. This is the first study to use patient similarity analytics on a cardiometabolic syndrome-related dataset sourced from medical institutions in Singapore. We consider patient similarity among patient cohorts with the same medical conditions to develop localized models for personalized decision support to improve the outcomes of a target patient.
format Online
Article
Text
id pubmed-8398126
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83981262021-08-29 Using Domain Knowledge and Data-Driven Insights for Patient Similarity Analytics Oei, Ronald Wihal Fang, Hao Sen Andrew Tan, Wei-Ying Hsu, Wynne Lee, Mong-Li Tan, Ngiap-Chuan J Pers Med Article Patient similarity analytics has emerged as an essential tool to identify cohorts of patients who have similar clinical characteristics to some specific patient of interest. In this study, we propose a patient similarity measure called D3K that incorporates domain knowledge and data-driven insights. Using the electronic health records (EHRs) of 169,434 patients with either diabetes, hypertension or dyslipidaemia (DHL), we construct patient feature vectors containing demographics, vital signs, laboratory test results, and prescribed medications. We discretize the variables of interest into various bins based on domain knowledge and make the patient similarity computation to be aligned with clinical guidelines. Key findings from this study are: (1) D3K outperforms baseline approaches in all seven sub-cohorts; (2) our domain knowledge-based binning strategy outperformed the traditional percentile-based binning in all seven sub-cohorts; (3) there is substantial agreement between D3K and physicians (κ = 0.746), indicating that D3K can be applied to facilitate shared decision making. This is the first study to use patient similarity analytics on a cardiometabolic syndrome-related dataset sourced from medical institutions in Singapore. We consider patient similarity among patient cohorts with the same medical conditions to develop localized models for personalized decision support to improve the outcomes of a target patient. MDPI 2021-07-22 /pmc/articles/PMC8398126/ /pubmed/34442343 http://dx.doi.org/10.3390/jpm11080699 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Oei, Ronald Wihal
Fang, Hao Sen Andrew
Tan, Wei-Ying
Hsu, Wynne
Lee, Mong-Li
Tan, Ngiap-Chuan
Using Domain Knowledge and Data-Driven Insights for Patient Similarity Analytics
title Using Domain Knowledge and Data-Driven Insights for Patient Similarity Analytics
title_full Using Domain Knowledge and Data-Driven Insights for Patient Similarity Analytics
title_fullStr Using Domain Knowledge and Data-Driven Insights for Patient Similarity Analytics
title_full_unstemmed Using Domain Knowledge and Data-Driven Insights for Patient Similarity Analytics
title_short Using Domain Knowledge and Data-Driven Insights for Patient Similarity Analytics
title_sort using domain knowledge and data-driven insights for patient similarity analytics
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8398126/
https://www.ncbi.nlm.nih.gov/pubmed/34442343
http://dx.doi.org/10.3390/jpm11080699
work_keys_str_mv AT oeironaldwihal usingdomainknowledgeanddatadriveninsightsforpatientsimilarityanalytics
AT fanghaosenandrew usingdomainknowledgeanddatadriveninsightsforpatientsimilarityanalytics
AT tanweiying usingdomainknowledgeanddatadriveninsightsforpatientsimilarityanalytics
AT hsuwynne usingdomainknowledgeanddatadriveninsightsforpatientsimilarityanalytics
AT leemongli usingdomainknowledgeanddatadriveninsightsforpatientsimilarityanalytics
AT tanngiapchuan usingdomainknowledgeanddatadriveninsightsforpatientsimilarityanalytics