Cargando…

Sequential Data–Based Patient Similarity Framework for Patient Outcome Prediction: Algorithm Development

BACKGROUND: Sequential information in electronic medical records is valuable and helpful for patient outcome prediction but is rarely used for patient similarity measurement because of its unevenness, irregularity, and heterogeneity. OBJECTIVE: We aimed to develop a patient similarity framework for...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Ni, Wang, Muyu, Zhou, Yang, Liu, Honglei, Wei, Lan, Fei, Xiaolu, Chen, Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8778569/
https://www.ncbi.nlm.nih.gov/pubmed/34989682
http://dx.doi.org/10.2196/30720
_version_ 1784637357786398720
author Wang, Ni
Wang, Muyu
Zhou, Yang
Liu, Honglei
Wei, Lan
Fei, Xiaolu
Chen, Hui
author_facet Wang, Ni
Wang, Muyu
Zhou, Yang
Liu, Honglei
Wei, Lan
Fei, Xiaolu
Chen, Hui
author_sort Wang, Ni
collection PubMed
description BACKGROUND: Sequential information in electronic medical records is valuable and helpful for patient outcome prediction but is rarely used for patient similarity measurement because of its unevenness, irregularity, and heterogeneity. OBJECTIVE: We aimed to develop a patient similarity framework for patient outcome prediction that makes use of sequential and cross-sectional information in electronic medical record systems. METHODS: Sequence similarity was calculated from timestamped event sequences using edit distance, and trend similarity was calculated from time series using dynamic time warping and Haar decomposition. We also extracted cross-sectional information, namely, demographic, laboratory test, and radiological report data, for additional similarity calculations. We validated the effectiveness of the framework by constructing k–nearest neighbors classifiers to predict mortality and readmission for acute myocardial infarction patients, using data from (1) a public data set and (2) a private data set, at 3 time points—at admission, on Day 7, and at discharge—to provide early warning patient outcomes. We also constructed state-of-the-art Euclidean-distance k–nearest neighbor, logistic regression, random forest, long short-term memory network, and recurrent neural network models, which were used for comparison. RESULTS: With all available information during a hospitalization episode, predictive models using the similarity model outperformed baseline models based on both public and private data sets. For mortality predictions, all models except for the logistic regression model showed improved performances over time. There were no such increasing trends in predictive performances for readmission predictions. The random forest and logistic regression models performed best for mortality and readmission predictions, respectively, when using information from the first week after admission. CONCLUSIONS: For patient outcome predictions, the patient similarity framework facilitated sequential similarity calculations for uneven electronic medical record data and helped improve predictive performance.
format Online
Article
Text
id pubmed-8778569
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-87785692022-02-03 Sequential Data–Based Patient Similarity Framework for Patient Outcome Prediction: Algorithm Development Wang, Ni Wang, Muyu Zhou, Yang Liu, Honglei Wei, Lan Fei, Xiaolu Chen, Hui J Med Internet Res Original Paper BACKGROUND: Sequential information in electronic medical records is valuable and helpful for patient outcome prediction but is rarely used for patient similarity measurement because of its unevenness, irregularity, and heterogeneity. OBJECTIVE: We aimed to develop a patient similarity framework for patient outcome prediction that makes use of sequential and cross-sectional information in electronic medical record systems. METHODS: Sequence similarity was calculated from timestamped event sequences using edit distance, and trend similarity was calculated from time series using dynamic time warping and Haar decomposition. We also extracted cross-sectional information, namely, demographic, laboratory test, and radiological report data, for additional similarity calculations. We validated the effectiveness of the framework by constructing k–nearest neighbors classifiers to predict mortality and readmission for acute myocardial infarction patients, using data from (1) a public data set and (2) a private data set, at 3 time points—at admission, on Day 7, and at discharge—to provide early warning patient outcomes. We also constructed state-of-the-art Euclidean-distance k–nearest neighbor, logistic regression, random forest, long short-term memory network, and recurrent neural network models, which were used for comparison. RESULTS: With all available information during a hospitalization episode, predictive models using the similarity model outperformed baseline models based on both public and private data sets. For mortality predictions, all models except for the logistic regression model showed improved performances over time. There were no such increasing trends in predictive performances for readmission predictions. The random forest and logistic regression models performed best for mortality and readmission predictions, respectively, when using information from the first week after admission. CONCLUSIONS: For patient outcome predictions, the patient similarity framework facilitated sequential similarity calculations for uneven electronic medical record data and helped improve predictive performance. JMIR Publications 2022-01-06 /pmc/articles/PMC8778569/ /pubmed/34989682 http://dx.doi.org/10.2196/30720 Text en ©Ni Wang, Muyu Wang, Yang Zhou, Honglei Liu, Lan Wei, Xiaolu Fei, Hui Chen. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 06.01.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Wang, Ni
Wang, Muyu
Zhou, Yang
Liu, Honglei
Wei, Lan
Fei, Xiaolu
Chen, Hui
Sequential Data–Based Patient Similarity Framework for Patient Outcome Prediction: Algorithm Development
title Sequential Data–Based Patient Similarity Framework for Patient Outcome Prediction: Algorithm Development
title_full Sequential Data–Based Patient Similarity Framework for Patient Outcome Prediction: Algorithm Development
title_fullStr Sequential Data–Based Patient Similarity Framework for Patient Outcome Prediction: Algorithm Development
title_full_unstemmed Sequential Data–Based Patient Similarity Framework for Patient Outcome Prediction: Algorithm Development
title_short Sequential Data–Based Patient Similarity Framework for Patient Outcome Prediction: Algorithm Development
title_sort sequential data–based patient similarity framework for patient outcome prediction: algorithm development
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8778569/
https://www.ncbi.nlm.nih.gov/pubmed/34989682
http://dx.doi.org/10.2196/30720
work_keys_str_mv AT wangni sequentialdatabasedpatientsimilarityframeworkforpatientoutcomepredictionalgorithmdevelopment
AT wangmuyu sequentialdatabasedpatientsimilarityframeworkforpatientoutcomepredictionalgorithmdevelopment
AT zhouyang sequentialdatabasedpatientsimilarityframeworkforpatientoutcomepredictionalgorithmdevelopment
AT liuhonglei sequentialdatabasedpatientsimilarityframeworkforpatientoutcomepredictionalgorithmdevelopment
AT weilan sequentialdatabasedpatientsimilarityframeworkforpatientoutcomepredictionalgorithmdevelopment
AT feixiaolu sequentialdatabasedpatientsimilarityframeworkforpatientoutcomepredictionalgorithmdevelopment
AT chenhui sequentialdatabasedpatientsimilarityframeworkforpatientoutcomepredictionalgorithmdevelopment