Cargando…
Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models
The COVID-19 pandemic led to unparalleled pressure on healthcare services. Improved healthcare planning in relation to diseases affecting the respiratory system has consequently become a key concern. We investigated the value of integrating sales of non-prescription medications commonly bought for m...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10663456/ https://www.ncbi.nlm.nih.gov/pubmed/37990023 http://dx.doi.org/10.1038/s41467-023-42776-4 |
_version_ | 1785148639756156928 |
---|---|
author | Dolan, Elizabeth Goulding, James Marshall, Harry Smith, Gavin Long, Gavin Tata, Laila J. |
author_facet | Dolan, Elizabeth Goulding, James Marshall, Harry Smith, Gavin Long, Gavin Tata, Laila J. |
author_sort | Dolan, Elizabeth |
collection | PubMed |
description | The COVID-19 pandemic led to unparalleled pressure on healthcare services. Improved healthcare planning in relation to diseases affecting the respiratory system has consequently become a key concern. We investigated the value of integrating sales of non-prescription medications commonly bought for managing respiratory symptoms, to improve forecasting of weekly registered deaths from respiratory disease at local levels across England, by using over 2 billion transactions logged by a UK high street retailer from March 2016 to March 2020. We report the results from the novel AI (Artificial Intelligence) explainability variable importance tool Model Class Reliance implemented on the PADRUS model (Prediction of Amount of Deaths by Respiratory disease Using Sales). PADRUS is a machine learning model optimised to predict registered deaths from respiratory disease in 314 local authority areas across England through the integration of shopping sales data and focused on purchases of non-prescription medications. We found strong evidence that models incorporating sales data significantly out-perform other models that solely use variables traditionally associated with respiratory disease (e.g. sociodemographics and weather data). Accuracy gains are highest (increases in R(2) (coefficient of determination) between 0.09 to 0.11) in periods of maximum risk to the general public. Results demonstrate the potential to utilise sales data to monitor population health with information at a high level of geographic granularity. |
format | Online Article Text |
id | pubmed-10663456 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-106634562023-11-21 Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models Dolan, Elizabeth Goulding, James Marshall, Harry Smith, Gavin Long, Gavin Tata, Laila J. Nat Commun Article The COVID-19 pandemic led to unparalleled pressure on healthcare services. Improved healthcare planning in relation to diseases affecting the respiratory system has consequently become a key concern. We investigated the value of integrating sales of non-prescription medications commonly bought for managing respiratory symptoms, to improve forecasting of weekly registered deaths from respiratory disease at local levels across England, by using over 2 billion transactions logged by a UK high street retailer from March 2016 to March 2020. We report the results from the novel AI (Artificial Intelligence) explainability variable importance tool Model Class Reliance implemented on the PADRUS model (Prediction of Amount of Deaths by Respiratory disease Using Sales). PADRUS is a machine learning model optimised to predict registered deaths from respiratory disease in 314 local authority areas across England through the integration of shopping sales data and focused on purchases of non-prescription medications. We found strong evidence that models incorporating sales data significantly out-perform other models that solely use variables traditionally associated with respiratory disease (e.g. sociodemographics and weather data). Accuracy gains are highest (increases in R(2) (coefficient of determination) between 0.09 to 0.11) in periods of maximum risk to the general public. Results demonstrate the potential to utilise sales data to monitor population health with information at a high level of geographic granularity. Nature Publishing Group UK 2023-11-21 /pmc/articles/PMC10663456/ /pubmed/37990023 http://dx.doi.org/10.1038/s41467-023-42776-4 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Dolan, Elizabeth Goulding, James Marshall, Harry Smith, Gavin Long, Gavin Tata, Laila J. Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models |
title | Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models |
title_full | Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models |
title_fullStr | Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models |
title_full_unstemmed | Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models |
title_short | Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models |
title_sort | assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10663456/ https://www.ncbi.nlm.nih.gov/pubmed/37990023 http://dx.doi.org/10.1038/s41467-023-42776-4 |
work_keys_str_mv | AT dolanelizabeth assessingthevalueofintegratingnationallongitudinalshoppingdataintorespiratorydiseaseforecastingmodels AT gouldingjames assessingthevalueofintegratingnationallongitudinalshoppingdataintorespiratorydiseaseforecastingmodels AT marshallharry assessingthevalueofintegratingnationallongitudinalshoppingdataintorespiratorydiseaseforecastingmodels AT smithgavin assessingthevalueofintegratingnationallongitudinalshoppingdataintorespiratorydiseaseforecastingmodels AT longgavin assessingthevalueofintegratingnationallongitudinalshoppingdataintorespiratorydiseaseforecastingmodels AT tatalailaj assessingthevalueofintegratingnationallongitudinalshoppingdataintorespiratorydiseaseforecastingmodels |