Cargando…
Analytical methods for identifying sequences of utilization in health data: a scoping review
BACKGROUND: Healthcare, as with other sectors, has undergone progressive digitalization, generating an ever-increasing wealth of data that enables research and the analysis of patient movement. This can help to evaluate treatment processes and outcomes, and in turn improve the quality of care. This...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10523647/ https://www.ncbi.nlm.nih.gov/pubmed/37759162 http://dx.doi.org/10.1186/s12874-023-02019-y |
_version_ | 1785110604115083264 |
---|---|
author | Flothow, Amelie Novelli, Anna Sundmacher, Leonie |
author_facet | Flothow, Amelie Novelli, Anna Sundmacher, Leonie |
author_sort | Flothow, Amelie |
collection | PubMed |
description | BACKGROUND: Healthcare, as with other sectors, has undergone progressive digitalization, generating an ever-increasing wealth of data that enables research and the analysis of patient movement. This can help to evaluate treatment processes and outcomes, and in turn improve the quality of care. This scoping review provides an overview of the algorithms and methods that have been used to identify care pathways from healthcare utilization data. METHOD: This review was conducted according to the methodology of the Joanna Briggs Institute and the Preferred Reporting Items for Systematic Reviews Extension for Scoping Reviews (PRISMA-ScR) Checklist. The PubMed, Web of Science, Scopus, and EconLit databases were searched and studies published in English between 2000 and 2021 considered. The search strategy used keywords divided into three categories: the method of data analysis, the requirement profile for the data, and the intended presentation of results. Criteria for inclusion were that health data were analyzed, the methodology used was described and that the chronology of care events was considered. In a two-stage review process, records were reviewed by two researchers independently for inclusion. Results were synthesized narratively. RESULTS: The literature search yielded 2,865 entries; 51 studies met the inclusion criteria. Health data from different countries ([Formula: see text] ) and of different types of disease ([Formula: see text] ) were analyzed with respect to different care events. Applied methods can be divided into those identifying subsequences of care and those describing full care trajectories. Variants of pattern mining or Markov models were mostly used to extract subsequences, with clustering often applied to find care trajectories. Statistical algorithms such as rule mining, probability-based machine learning algorithms or a combination of methods were also applied. Clustering methods were sometimes used for data preparation or result compression. Further characteristics of the included studies are presented. CONCLUSION: Various data mining methods are already being applied to gain insight from health data. The great heterogeneity of the methods used shows the need for a scoping review. We performed a narrative review and found that clustering methods currently dominate the literature for identifying complete care trajectories, while variants of pattern mining dominate for identifying subsequences of limited length. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-023-02019-y. |
format | Online Article Text |
id | pubmed-10523647 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-105236472023-09-28 Analytical methods for identifying sequences of utilization in health data: a scoping review Flothow, Amelie Novelli, Anna Sundmacher, Leonie BMC Med Res Methodol Research Article BACKGROUND: Healthcare, as with other sectors, has undergone progressive digitalization, generating an ever-increasing wealth of data that enables research and the analysis of patient movement. This can help to evaluate treatment processes and outcomes, and in turn improve the quality of care. This scoping review provides an overview of the algorithms and methods that have been used to identify care pathways from healthcare utilization data. METHOD: This review was conducted according to the methodology of the Joanna Briggs Institute and the Preferred Reporting Items for Systematic Reviews Extension for Scoping Reviews (PRISMA-ScR) Checklist. The PubMed, Web of Science, Scopus, and EconLit databases were searched and studies published in English between 2000 and 2021 considered. The search strategy used keywords divided into three categories: the method of data analysis, the requirement profile for the data, and the intended presentation of results. Criteria for inclusion were that health data were analyzed, the methodology used was described and that the chronology of care events was considered. In a two-stage review process, records were reviewed by two researchers independently for inclusion. Results were synthesized narratively. RESULTS: The literature search yielded 2,865 entries; 51 studies met the inclusion criteria. Health data from different countries ([Formula: see text] ) and of different types of disease ([Formula: see text] ) were analyzed with respect to different care events. Applied methods can be divided into those identifying subsequences of care and those describing full care trajectories. Variants of pattern mining or Markov models were mostly used to extract subsequences, with clustering often applied to find care trajectories. Statistical algorithms such as rule mining, probability-based machine learning algorithms or a combination of methods were also applied. Clustering methods were sometimes used for data preparation or result compression. Further characteristics of the included studies are presented. CONCLUSION: Various data mining methods are already being applied to gain insight from health data. The great heterogeneity of the methods used shows the need for a scoping review. We performed a narrative review and found that clustering methods currently dominate the literature for identifying complete care trajectories, while variants of pattern mining dominate for identifying subsequences of limited length. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-023-02019-y. BioMed Central 2023-09-27 /pmc/articles/PMC10523647/ /pubmed/37759162 http://dx.doi.org/10.1186/s12874-023-02019-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Flothow, Amelie Novelli, Anna Sundmacher, Leonie Analytical methods for identifying sequences of utilization in health data: a scoping review |
title | Analytical methods for identifying sequences of utilization in health data: a scoping review |
title_full | Analytical methods for identifying sequences of utilization in health data: a scoping review |
title_fullStr | Analytical methods for identifying sequences of utilization in health data: a scoping review |
title_full_unstemmed | Analytical methods for identifying sequences of utilization in health data: a scoping review |
title_short | Analytical methods for identifying sequences of utilization in health data: a scoping review |
title_sort | analytical methods for identifying sequences of utilization in health data: a scoping review |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10523647/ https://www.ncbi.nlm.nih.gov/pubmed/37759162 http://dx.doi.org/10.1186/s12874-023-02019-y |
work_keys_str_mv | AT flothowamelie analyticalmethodsforidentifyingsequencesofutilizationinhealthdataascopingreview AT novellianna analyticalmethodsforidentifyingsequencesofutilizationinhealthdataascopingreview AT sundmacherleonie analyticalmethodsforidentifyingsequencesofutilizationinhealthdataascopingreview |