Cargando…

Analytical methods for identifying sequences of utilization in health data: a scoping review

BACKGROUND: Healthcare, as with other sectors, has undergone progressive digitalization, generating an ever-increasing wealth of data that enables research and the analysis of patient movement. This can help to evaluate treatment processes and outcomes, and in turn improve the quality of care. This...

Descripción completa

Detalles Bibliográficos
Autores principales: Flothow, Amelie, Novelli, Anna, Sundmacher, Leonie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10523647/
https://www.ncbi.nlm.nih.gov/pubmed/37759162
http://dx.doi.org/10.1186/s12874-023-02019-y
_version_ 1785110604115083264
author Flothow, Amelie
Novelli, Anna
Sundmacher, Leonie
author_facet Flothow, Amelie
Novelli, Anna
Sundmacher, Leonie
author_sort Flothow, Amelie
collection PubMed
description BACKGROUND: Healthcare, as with other sectors, has undergone progressive digitalization, generating an ever-increasing wealth of data that enables research and the analysis of patient movement. This can help to evaluate treatment processes and outcomes, and in turn improve the quality of care. This scoping review provides an overview of the algorithms and methods that have been used to identify care pathways from healthcare utilization data. METHOD: This review was conducted according to the methodology of the Joanna Briggs Institute and the Preferred Reporting Items for Systematic Reviews Extension for Scoping Reviews (PRISMA-ScR) Checklist. The PubMed, Web of Science, Scopus, and EconLit databases were searched and studies published in English between 2000 and 2021 considered. The search strategy used keywords divided into three categories: the method of data analysis, the requirement profile for the data, and the intended presentation of results. Criteria for inclusion were that health data were analyzed, the methodology used was described and that the chronology of care events was considered. In a two-stage review process, records were reviewed by two researchers independently for inclusion. Results were synthesized narratively. RESULTS: The literature search yielded 2,865 entries; 51 studies met the inclusion criteria. Health data from different countries ([Formula: see text] ) and of different types of disease ([Formula: see text] ) were analyzed with respect to different care events. Applied methods can be divided into those identifying subsequences of care and those describing full care trajectories. Variants of pattern mining or Markov models were mostly used to extract subsequences, with clustering often applied to find care trajectories. Statistical algorithms such as rule mining, probability-based machine learning algorithms or a combination of methods were also applied. Clustering methods were sometimes used for data preparation or result compression. Further characteristics of the included studies are presented. CONCLUSION: Various data mining methods are already being applied to gain insight from health data. The great heterogeneity of the methods used shows the need for a scoping review. We performed a narrative review and found that clustering methods currently dominate the literature for identifying complete care trajectories, while variants of pattern mining dominate for identifying subsequences of limited length. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-023-02019-y.
format Online
Article
Text
id pubmed-10523647
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-105236472023-09-28 Analytical methods for identifying sequences of utilization in health data: a scoping review Flothow, Amelie Novelli, Anna Sundmacher, Leonie BMC Med Res Methodol Research Article BACKGROUND: Healthcare, as with other sectors, has undergone progressive digitalization, generating an ever-increasing wealth of data that enables research and the analysis of patient movement. This can help to evaluate treatment processes and outcomes, and in turn improve the quality of care. This scoping review provides an overview of the algorithms and methods that have been used to identify care pathways from healthcare utilization data. METHOD: This review was conducted according to the methodology of the Joanna Briggs Institute and the Preferred Reporting Items for Systematic Reviews Extension for Scoping Reviews (PRISMA-ScR) Checklist. The PubMed, Web of Science, Scopus, and EconLit databases were searched and studies published in English between 2000 and 2021 considered. The search strategy used keywords divided into three categories: the method of data analysis, the requirement profile for the data, and the intended presentation of results. Criteria for inclusion were that health data were analyzed, the methodology used was described and that the chronology of care events was considered. In a two-stage review process, records were reviewed by two researchers independently for inclusion. Results were synthesized narratively. RESULTS: The literature search yielded 2,865 entries; 51 studies met the inclusion criteria. Health data from different countries ([Formula: see text] ) and of different types of disease ([Formula: see text] ) were analyzed with respect to different care events. Applied methods can be divided into those identifying subsequences of care and those describing full care trajectories. Variants of pattern mining or Markov models were mostly used to extract subsequences, with clustering often applied to find care trajectories. Statistical algorithms such as rule mining, probability-based machine learning algorithms or a combination of methods were also applied. Clustering methods were sometimes used for data preparation or result compression. Further characteristics of the included studies are presented. CONCLUSION: Various data mining methods are already being applied to gain insight from health data. The great heterogeneity of the methods used shows the need for a scoping review. We performed a narrative review and found that clustering methods currently dominate the literature for identifying complete care trajectories, while variants of pattern mining dominate for identifying subsequences of limited length. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-023-02019-y. BioMed Central 2023-09-27 /pmc/articles/PMC10523647/ /pubmed/37759162 http://dx.doi.org/10.1186/s12874-023-02019-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Flothow, Amelie
Novelli, Anna
Sundmacher, Leonie
Analytical methods for identifying sequences of utilization in health data: a scoping review
title Analytical methods for identifying sequences of utilization in health data: a scoping review
title_full Analytical methods for identifying sequences of utilization in health data: a scoping review
title_fullStr Analytical methods for identifying sequences of utilization in health data: a scoping review
title_full_unstemmed Analytical methods for identifying sequences of utilization in health data: a scoping review
title_short Analytical methods for identifying sequences of utilization in health data: a scoping review
title_sort analytical methods for identifying sequences of utilization in health data: a scoping review
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10523647/
https://www.ncbi.nlm.nih.gov/pubmed/37759162
http://dx.doi.org/10.1186/s12874-023-02019-y
work_keys_str_mv AT flothowamelie analyticalmethodsforidentifyingsequencesofutilizationinhealthdataascopingreview
AT novellianna analyticalmethodsforidentifyingsequencesofutilizationinhealthdataascopingreview
AT sundmacherleonie analyticalmethodsforidentifyingsequencesofutilizationinhealthdataascopingreview