Cargando…

Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models

The COVID-19 pandemic led to unparalleled pressure on healthcare services. Improved healthcare planning in relation to diseases affecting the respiratory system has consequently become a key concern. We investigated the value of integrating sales of non-prescription medications commonly bought for m...

Descripción completa

Detalles Bibliográficos
Autores principales: Dolan, Elizabeth, Goulding, James, Marshall, Harry, Smith, Gavin, Long, Gavin, Tata, Laila J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10663456/
https://www.ncbi.nlm.nih.gov/pubmed/37990023
http://dx.doi.org/10.1038/s41467-023-42776-4
_version_ 1785148639756156928
author Dolan, Elizabeth
Goulding, James
Marshall, Harry
Smith, Gavin
Long, Gavin
Tata, Laila J.
author_facet Dolan, Elizabeth
Goulding, James
Marshall, Harry
Smith, Gavin
Long, Gavin
Tata, Laila J.
author_sort Dolan, Elizabeth
collection PubMed
description The COVID-19 pandemic led to unparalleled pressure on healthcare services. Improved healthcare planning in relation to diseases affecting the respiratory system has consequently become a key concern. We investigated the value of integrating sales of non-prescription medications commonly bought for managing respiratory symptoms, to improve forecasting of weekly registered deaths from respiratory disease at local levels across England, by using over 2 billion transactions logged by a UK high street retailer from March 2016 to March 2020. We report the results from the novel AI (Artificial Intelligence) explainability variable importance tool Model Class Reliance implemented on the PADRUS model (Prediction of Amount of Deaths by Respiratory disease Using Sales). PADRUS is a machine learning model optimised to predict registered deaths from respiratory disease in 314 local authority areas across England through the integration of shopping sales data and focused on purchases of non-prescription medications. We found strong evidence that models incorporating sales data significantly out-perform other models that solely use variables traditionally associated with respiratory disease (e.g. sociodemographics and weather data). Accuracy gains are highest (increases in R(2) (coefficient of determination) between 0.09 to 0.11) in periods of maximum risk to the general public. Results demonstrate the potential to utilise sales data to monitor population health with information at a high level of geographic granularity.
format Online
Article
Text
id pubmed-10663456
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106634562023-11-21 Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models Dolan, Elizabeth Goulding, James Marshall, Harry Smith, Gavin Long, Gavin Tata, Laila J. Nat Commun Article The COVID-19 pandemic led to unparalleled pressure on healthcare services. Improved healthcare planning in relation to diseases affecting the respiratory system has consequently become a key concern. We investigated the value of integrating sales of non-prescription medications commonly bought for managing respiratory symptoms, to improve forecasting of weekly registered deaths from respiratory disease at local levels across England, by using over 2 billion transactions logged by a UK high street retailer from March 2016 to March 2020. We report the results from the novel AI (Artificial Intelligence) explainability variable importance tool Model Class Reliance implemented on the PADRUS model (Prediction of Amount of Deaths by Respiratory disease Using Sales). PADRUS is a machine learning model optimised to predict registered deaths from respiratory disease in 314 local authority areas across England through the integration of shopping sales data and focused on purchases of non-prescription medications. We found strong evidence that models incorporating sales data significantly out-perform other models that solely use variables traditionally associated with respiratory disease (e.g. sociodemographics and weather data). Accuracy gains are highest (increases in R(2) (coefficient of determination) between 0.09 to 0.11) in periods of maximum risk to the general public. Results demonstrate the potential to utilise sales data to monitor population health with information at a high level of geographic granularity. Nature Publishing Group UK 2023-11-21 /pmc/articles/PMC10663456/ /pubmed/37990023 http://dx.doi.org/10.1038/s41467-023-42776-4 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Dolan, Elizabeth
Goulding, James
Marshall, Harry
Smith, Gavin
Long, Gavin
Tata, Laila J.
Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models
title Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models
title_full Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models
title_fullStr Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models
title_full_unstemmed Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models
title_short Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models
title_sort assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10663456/
https://www.ncbi.nlm.nih.gov/pubmed/37990023
http://dx.doi.org/10.1038/s41467-023-42776-4
work_keys_str_mv AT dolanelizabeth assessingthevalueofintegratingnationallongitudinalshoppingdataintorespiratorydiseaseforecastingmodels
AT gouldingjames assessingthevalueofintegratingnationallongitudinalshoppingdataintorespiratorydiseaseforecastingmodels
AT marshallharry assessingthevalueofintegratingnationallongitudinalshoppingdataintorespiratorydiseaseforecastingmodels
AT smithgavin assessingthevalueofintegratingnationallongitudinalshoppingdataintorespiratorydiseaseforecastingmodels
AT longgavin assessingthevalueofintegratingnationallongitudinalshoppingdataintorespiratorydiseaseforecastingmodels
AT tatalailaj assessingthevalueofintegratingnationallongitudinalshoppingdataintorespiratorydiseaseforecastingmodels