Cargando…

Transforming a Patient Registry Into a Customized Data Set for the Advanced Statistical Analysis of Health Risk Factors and for Medication-Related Hospitalization Research: Retrospective Hospital Patient Registry Study

BACKGROUND: Hospital patient registries provide substantial longitudinal data sets describing the clinical and medical health statuses of inpatients and their pharmacological prescriptions. Despite the multiple advantages of routinely collecting multidimensional longitudinal data, those data sets ar...

Descripción completa

Detalles Bibliográficos
Autores principales: Taushanov, Zhivko, Verloo, Henk, Wernli, Boris, Di Giovanni, Saviana, von Gunten, Armin, Pereira, Filipa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8150425/
https://www.ncbi.nlm.nih.gov/pubmed/33973865
http://dx.doi.org/10.2196/24205
_version_ 1783698149090525184
author Taushanov, Zhivko
Verloo, Henk
Wernli, Boris
Di Giovanni, Saviana
von Gunten, Armin
Pereira, Filipa
author_facet Taushanov, Zhivko
Verloo, Henk
Wernli, Boris
Di Giovanni, Saviana
von Gunten, Armin
Pereira, Filipa
author_sort Taushanov, Zhivko
collection PubMed
description BACKGROUND: Hospital patient registries provide substantial longitudinal data sets describing the clinical and medical health statuses of inpatients and their pharmacological prescriptions. Despite the multiple advantages of routinely collecting multidimensional longitudinal data, those data sets are rarely suitable for advanced statistical analysis and they require customization and synthesis. OBJECTIVE: The aim of this study was to describe the methods used to transform and synthesize a raw, multidimensional, hospital patient registry data set into an exploitable database for the further investigation of risk profiles and predictive and survival health outcomes among polymorbid, polymedicated, older inpatients in relation to their medicine prescriptions at hospital discharge. METHODS: A raw, multidimensional data set from a public hospital was extracted from the hospital registry in a CSV (.csv) file and imported into the R statistical package for cleaning, customization, and synthesis. Patients fulfilling the criteria for inclusion were home-dwelling, polymedicated, older adults with multiple chronic conditions aged ≥65 who became hospitalized. The patient data set covered 140 variables from 20,422 hospitalizations of polymedicated, home-dwelling older adults from 2015 to 2018. Each variable, according to type, was explored and computed to describe distributions, missing values, and associations. Different clustering methods, expert opinion, recoding, and missing-value techniques were used to customize and synthesize these multidimensional data sets. RESULTS: Sociodemographic data showed no missing values. Average age, hospital length of stay, and frequency of hospitalization were computed. Discharge details were recoded and summarized. Clinical data were cleaned up and best practices for managing missing values were applied. Seven clusters of medical diagnoses, surgical interventions, somatic, cognitive, and medicines data were extracted using empirical and statistical best practices, with each presenting the health status of the patients included in it as accurately as possible. Medical, comorbidity, and drug data were recoded and summarized. CONCLUSIONS: A cleaner, better-structured data set was obtained, combining empirical and best-practice statistical approaches. The overall strategy delivered an exploitable, population-based database suitable for an advanced analysis of the descriptive, predictive, and survival statistics relating to polymedicated, home-dwelling older adults admitted as inpatients. More research is needed to develop best practices for customizing and synthesizing large, multidimensional, population-based registries. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): RR2-10.1136/bmjopen-2019-030030
format Online
Article
Text
id pubmed-8150425
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-81504252021-06-11 Transforming a Patient Registry Into a Customized Data Set for the Advanced Statistical Analysis of Health Risk Factors and for Medication-Related Hospitalization Research: Retrospective Hospital Patient Registry Study Taushanov, Zhivko Verloo, Henk Wernli, Boris Di Giovanni, Saviana von Gunten, Armin Pereira, Filipa JMIR Med Inform Original Paper BACKGROUND: Hospital patient registries provide substantial longitudinal data sets describing the clinical and medical health statuses of inpatients and their pharmacological prescriptions. Despite the multiple advantages of routinely collecting multidimensional longitudinal data, those data sets are rarely suitable for advanced statistical analysis and they require customization and synthesis. OBJECTIVE: The aim of this study was to describe the methods used to transform and synthesize a raw, multidimensional, hospital patient registry data set into an exploitable database for the further investigation of risk profiles and predictive and survival health outcomes among polymorbid, polymedicated, older inpatients in relation to their medicine prescriptions at hospital discharge. METHODS: A raw, multidimensional data set from a public hospital was extracted from the hospital registry in a CSV (.csv) file and imported into the R statistical package for cleaning, customization, and synthesis. Patients fulfilling the criteria for inclusion were home-dwelling, polymedicated, older adults with multiple chronic conditions aged ≥65 who became hospitalized. The patient data set covered 140 variables from 20,422 hospitalizations of polymedicated, home-dwelling older adults from 2015 to 2018. Each variable, according to type, was explored and computed to describe distributions, missing values, and associations. Different clustering methods, expert opinion, recoding, and missing-value techniques were used to customize and synthesize these multidimensional data sets. RESULTS: Sociodemographic data showed no missing values. Average age, hospital length of stay, and frequency of hospitalization were computed. Discharge details were recoded and summarized. Clinical data were cleaned up and best practices for managing missing values were applied. Seven clusters of medical diagnoses, surgical interventions, somatic, cognitive, and medicines data were extracted using empirical and statistical best practices, with each presenting the health status of the patients included in it as accurately as possible. Medical, comorbidity, and drug data were recoded and summarized. CONCLUSIONS: A cleaner, better-structured data set was obtained, combining empirical and best-practice statistical approaches. The overall strategy delivered an exploitable, population-based database suitable for an advanced analysis of the descriptive, predictive, and survival statistics relating to polymedicated, home-dwelling older adults admitted as inpatients. More research is needed to develop best practices for customizing and synthesizing large, multidimensional, population-based registries. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): RR2-10.1136/bmjopen-2019-030030 JMIR Publications 2021-05-11 /pmc/articles/PMC8150425/ /pubmed/33973865 http://dx.doi.org/10.2196/24205 Text en ©Zhivko Taushanov, Henk Verloo, Boris Wernli, Saviana Di Giovanni, Armin von Gunten, Filipa Pereira. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.05.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Taushanov, Zhivko
Verloo, Henk
Wernli, Boris
Di Giovanni, Saviana
von Gunten, Armin
Pereira, Filipa
Transforming a Patient Registry Into a Customized Data Set for the Advanced Statistical Analysis of Health Risk Factors and for Medication-Related Hospitalization Research: Retrospective Hospital Patient Registry Study
title Transforming a Patient Registry Into a Customized Data Set for the Advanced Statistical Analysis of Health Risk Factors and for Medication-Related Hospitalization Research: Retrospective Hospital Patient Registry Study
title_full Transforming a Patient Registry Into a Customized Data Set for the Advanced Statistical Analysis of Health Risk Factors and for Medication-Related Hospitalization Research: Retrospective Hospital Patient Registry Study
title_fullStr Transforming a Patient Registry Into a Customized Data Set for the Advanced Statistical Analysis of Health Risk Factors and for Medication-Related Hospitalization Research: Retrospective Hospital Patient Registry Study
title_full_unstemmed Transforming a Patient Registry Into a Customized Data Set for the Advanced Statistical Analysis of Health Risk Factors and for Medication-Related Hospitalization Research: Retrospective Hospital Patient Registry Study
title_short Transforming a Patient Registry Into a Customized Data Set for the Advanced Statistical Analysis of Health Risk Factors and for Medication-Related Hospitalization Research: Retrospective Hospital Patient Registry Study
title_sort transforming a patient registry into a customized data set for the advanced statistical analysis of health risk factors and for medication-related hospitalization research: retrospective hospital patient registry study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8150425/
https://www.ncbi.nlm.nih.gov/pubmed/33973865
http://dx.doi.org/10.2196/24205
work_keys_str_mv AT taushanovzhivko transformingapatientregistryintoacustomizeddatasetfortheadvancedstatisticalanalysisofhealthriskfactorsandformedicationrelatedhospitalizationresearchretrospectivehospitalpatientregistrystudy
AT verloohenk transformingapatientregistryintoacustomizeddatasetfortheadvancedstatisticalanalysisofhealthriskfactorsandformedicationrelatedhospitalizationresearchretrospectivehospitalpatientregistrystudy
AT wernliboris transformingapatientregistryintoacustomizeddatasetfortheadvancedstatisticalanalysisofhealthriskfactorsandformedicationrelatedhospitalizationresearchretrospectivehospitalpatientregistrystudy
AT digiovannisaviana transformingapatientregistryintoacustomizeddatasetfortheadvancedstatisticalanalysisofhealthriskfactorsandformedicationrelatedhospitalizationresearchretrospectivehospitalpatientregistrystudy
AT vonguntenarmin transformingapatientregistryintoacustomizeddatasetfortheadvancedstatisticalanalysisofhealthriskfactorsandformedicationrelatedhospitalizationresearchretrospectivehospitalpatientregistrystudy
AT pereirafilipa transformingapatientregistryintoacustomizeddatasetfortheadvancedstatisticalanalysisofhealthriskfactorsandformedicationrelatedhospitalizationresearchretrospectivehospitalpatientregistrystudy