Cargando…
Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation
Long-term future prediction of geographic areas with high rates of potentially preventable hospitalisations (PPHs) among residents, or “hotspots”, is critical to ensure the effective location of place-based health service interventions. This is because such interventions are typically expensive and...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8508485/ https://www.ncbi.nlm.nih.gov/pubmed/34639555 http://dx.doi.org/10.3390/ijerph181910253 |
_version_ | 1784582108742680576 |
---|---|
author | Tuson, Matthew Turlach, Berwin Murray, Kevin Kok, Mei Ruu Vickery, Alistair Whyatt, David |
author_facet | Tuson, Matthew Turlach, Berwin Murray, Kevin Kok, Mei Ruu Vickery, Alistair Whyatt, David |
author_sort | Tuson, Matthew |
collection | PubMed |
description | Long-term future prediction of geographic areas with high rates of potentially preventable hospitalisations (PPHs) among residents, or “hotspots”, is critical to ensure the effective location of place-based health service interventions. This is because such interventions are typically expensive and take time to develop, implement, and take effect, and hotspots often regress to the mean. Using spatially aggregated, longitudinal administrative health data, we introduce a method to make such predictions. The proposed method combines all subset model selection with a novel formulation of repeated k-fold cross-validation in developing optimal models. We illustrate its application predicting three-year future hotspots for four PPHs in an Australian context: type II diabetes mellitus, heart failure, chronic obstructive pulmonary disease, and “high risk foot”. In these examples, optimal models are selected through maximising positive predictive value while maintaining sensitivity above a user-specified minimum threshold. We compare the model’s performance to that of two alternative methods commonly used in practice, i.e., prediction of future hotspots based on either: (i) current hotspots, or (ii) past persistent hotspots. In doing so, we demonstrate favourable performance of our method, including with respect to its ability to flexibly optimise various different metrics. Accordingly, we suggest that our method might effectively be used to assist health planners predict excess future demand of health services and prioritise placement of interventions. Furthermore, it could be used to predict future hotspots of non-health events, e.g., in criminology. |
format | Online Article Text |
id | pubmed-8508485 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-85084852021-10-13 Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation Tuson, Matthew Turlach, Berwin Murray, Kevin Kok, Mei Ruu Vickery, Alistair Whyatt, David Int J Environ Res Public Health Article Long-term future prediction of geographic areas with high rates of potentially preventable hospitalisations (PPHs) among residents, or “hotspots”, is critical to ensure the effective location of place-based health service interventions. This is because such interventions are typically expensive and take time to develop, implement, and take effect, and hotspots often regress to the mean. Using spatially aggregated, longitudinal administrative health data, we introduce a method to make such predictions. The proposed method combines all subset model selection with a novel formulation of repeated k-fold cross-validation in developing optimal models. We illustrate its application predicting three-year future hotspots for four PPHs in an Australian context: type II diabetes mellitus, heart failure, chronic obstructive pulmonary disease, and “high risk foot”. In these examples, optimal models are selected through maximising positive predictive value while maintaining sensitivity above a user-specified minimum threshold. We compare the model’s performance to that of two alternative methods commonly used in practice, i.e., prediction of future hotspots based on either: (i) current hotspots, or (ii) past persistent hotspots. In doing so, we demonstrate favourable performance of our method, including with respect to its ability to flexibly optimise various different metrics. Accordingly, we suggest that our method might effectively be used to assist health planners predict excess future demand of health services and prioritise placement of interventions. Furthermore, it could be used to predict future hotspots of non-health events, e.g., in criminology. MDPI 2021-09-29 /pmc/articles/PMC8508485/ /pubmed/34639555 http://dx.doi.org/10.3390/ijerph181910253 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Tuson, Matthew Turlach, Berwin Murray, Kevin Kok, Mei Ruu Vickery, Alistair Whyatt, David Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation |
title | Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation |
title_full | Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation |
title_fullStr | Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation |
title_full_unstemmed | Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation |
title_short | Predicting Future Geographic Hotspots of Potentially Preventable Hospitalisations Using All Subset Model Selection and Repeated K-Fold Cross-Validation |
title_sort | predicting future geographic hotspots of potentially preventable hospitalisations using all subset model selection and repeated k-fold cross-validation |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8508485/ https://www.ncbi.nlm.nih.gov/pubmed/34639555 http://dx.doi.org/10.3390/ijerph181910253 |
work_keys_str_mv | AT tusonmatthew predictingfuturegeographichotspotsofpotentiallypreventablehospitalisationsusingallsubsetmodelselectionandrepeatedkfoldcrossvalidation AT turlachberwin predictingfuturegeographichotspotsofpotentiallypreventablehospitalisationsusingallsubsetmodelselectionandrepeatedkfoldcrossvalidation AT murraykevin predictingfuturegeographichotspotsofpotentiallypreventablehospitalisationsusingallsubsetmodelselectionandrepeatedkfoldcrossvalidation AT kokmeiruu predictingfuturegeographichotspotsofpotentiallypreventablehospitalisationsusingallsubsetmodelselectionandrepeatedkfoldcrossvalidation AT vickeryalistair predictingfuturegeographichotspotsofpotentiallypreventablehospitalisationsusingallsubsetmodelselectionandrepeatedkfoldcrossvalidation AT whyattdavid predictingfuturegeographichotspotsofpotentiallypreventablehospitalisationsusingallsubsetmodelselectionandrepeatedkfoldcrossvalidation |