Cargando…

Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation

Long-term future prediction of geographic areas with high rates of potentially preventable hospitalisations (PPHs) among residents, or “hotspots”, is critical to ensure the effective location of place-based health service interventions. This is because such interventions are typically expensive and...

Descripción completa

Detalles Bibliográficos
Autores principales: Tuson, Matthew, Turlach, Berwin, Murray, Kevin, Kok, Mei Ruu, Vickery, Alistair, Whyatt, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8508485/
https://www.ncbi.nlm.nih.gov/pubmed/34639555
http://dx.doi.org/10.3390/ijerph181910253
_version_ 1784582108742680576
author Tuson, Matthew
Turlach, Berwin
Murray, Kevin
Kok, Mei Ruu
Vickery, Alistair
Whyatt, David
author_facet Tuson, Matthew
Turlach, Berwin
Murray, Kevin
Kok, Mei Ruu
Vickery, Alistair
Whyatt, David
author_sort Tuson, Matthew
collection PubMed
description Long-term future prediction of geographic areas with high rates of potentially preventable hospitalisations (PPHs) among residents, or “hotspots”, is critical to ensure the effective location of place-based health service interventions. This is because such interventions are typically expensive and take time to develop, implement, and take effect, and hotspots often regress to the mean. Using spatially aggregated, longitudinal administrative health data, we introduce a method to make such predictions. The proposed method combines all subset model selection with a novel formulation of repeated k-fold cross-validation in developing optimal models. We illustrate its application predicting three-year future hotspots for four PPHs in an Australian context: type II diabetes mellitus, heart failure, chronic obstructive pulmonary disease, and “high risk foot”. In these examples, optimal models are selected through maximising positive predictive value while maintaining sensitivity above a user-specified minimum threshold. We compare the model’s performance to that of two alternative methods commonly used in practice, i.e., prediction of future hotspots based on either: (i) current hotspots, or (ii) past persistent hotspots. In doing so, we demonstrate favourable performance of our method, including with respect to its ability to flexibly optimise various different metrics. Accordingly, we suggest that our method might effectively be used to assist health planners predict excess future demand of health services and prioritise placement of interventions. Furthermore, it could be used to predict future hotspots of non-health events, e.g., in criminology.
format Online
Article
Text
id pubmed-8508485
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-85084852021-10-13 Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation Tuson, Matthew Turlach, Berwin Murray, Kevin Kok, Mei Ruu Vickery, Alistair Whyatt, David Int J Environ Res Public Health Article Long-term future prediction of geographic areas with high rates of potentially preventable hospitalisations (PPHs) among residents, or “hotspots”, is critical to ensure the effective location of place-based health service interventions. This is because such interventions are typically expensive and take time to develop, implement, and take effect, and hotspots often regress to the mean. Using spatially aggregated, longitudinal administrative health data, we introduce a method to make such predictions. The proposed method combines all subset model selection with a novel formulation of repeated k-fold cross-validation in developing optimal models. We illustrate its application predicting three-year future hotspots for four PPHs in an Australian context: type II diabetes mellitus, heart failure, chronic obstructive pulmonary disease, and “high risk foot”. In these examples, optimal models are selected through maximising positive predictive value while maintaining sensitivity above a user-specified minimum threshold. We compare the model’s performance to that of two alternative methods commonly used in practice, i.e., prediction of future hotspots based on either: (i) current hotspots, or (ii) past persistent hotspots. In doing so, we demonstrate favourable performance of our method, including with respect to its ability to flexibly optimise various different metrics. Accordingly, we suggest that our method might effectively be used to assist health planners predict excess future demand of health services and prioritise placement of interventions. Furthermore, it could be used to predict future hotspots of non-health events, e.g., in criminology. MDPI 2021-09-29 /pmc/articles/PMC8508485/ /pubmed/34639555 http://dx.doi.org/10.3390/ijerph181910253 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tuson, Matthew
Turlach, Berwin
Murray, Kevin
Kok, Mei Ruu
Vickery, Alistair
Whyatt, David
Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation
title Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation
title_full Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation
title_fullStr Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation
title_full_unstemmed Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation
title_short Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation
title_sort predicting future geographic hotspots of potentially preventable hospitalisations using all subset model selection and repeated k-fold cross-validation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8508485/
https://www.ncbi.nlm.nih.gov/pubmed/34639555
http://dx.doi.org/10.3390/ijerph181910253
work_keys_str_mv AT tusonmatthew predictingfuturegeographichotspotsofpotentiallypreventablehospitalisationsusingallsubsetmodelselectionandrepeatedkfoldcrossvalidation
AT turlachberwin predictingfuturegeographichotspotsofpotentiallypreventablehospitalisationsusingallsubsetmodelselectionandrepeatedkfoldcrossvalidation
AT murraykevin predictingfuturegeographichotspotsofpotentiallypreventablehospitalisationsusingallsubsetmodelselectionandrepeatedkfoldcrossvalidation
AT kokmeiruu predictingfuturegeographichotspotsofpotentiallypreventablehospitalisationsusingallsubsetmodelselectionandrepeatedkfoldcrossvalidation
AT vickeryalistair predictingfuturegeographichotspotsofpotentiallypreventablehospitalisationsusingallsubsetmodelselectionandrepeatedkfoldcrossvalidation
AT whyattdavid predictingfuturegeographichotspotsofpotentiallypreventablehospitalisationsusingallsubsetmodelselectionandrepeatedkfoldcrossvalidation