Cargando…
Modified Needleman–Wunsch algorithm for clinical pathway clustering
Clinical pathways are used to guide clinicians to provide a standardised delivery of care. Because of their standardisation, the aim of clinical pathways is to reduce variation in both care process and patient outcomes. When learning clinical pathways from data through data mining, it is common prac...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7973729/ https://www.ncbi.nlm.nih.gov/pubmed/33359110 http://dx.doi.org/10.1016/j.jbi.2020.103668 |
_version_ | 1783666882154332160 |
---|---|
author | Aspland, Emma Harper, Paul R. Gartner, Daniel Webb, Philip Barrett-Lee, Peter |
author_facet | Aspland, Emma Harper, Paul R. Gartner, Daniel Webb, Philip Barrett-Lee, Peter |
author_sort | Aspland, Emma |
collection | PubMed |
description | Clinical pathways are used to guide clinicians to provide a standardised delivery of care. Because of their standardisation, the aim of clinical pathways is to reduce variation in both care process and patient outcomes. When learning clinical pathways from data through data mining, it is common practice to represent each patient pathway as a string corresponding to their movements through activities. Clustering techniques are popular methods for pathway mining, and therefore this paper focuses on distance metrics applied to string data for k-medoids clustering. The two main aims are to firstly, develop a technique that seamlessly integrates expert information with data and secondly, to develop a string distance metric for the purpose of process data. The overall goal was to allow for more meaningful clustering results to be found by adding context into the string similarity calculation. Eight common distance metrics and their applicability are discussed. These distance metrics prove to give an arbitrary distance, without consideration for context, and each produce different results. As a result, this paper describes the development of a new distance metric, the modified Needleman–Wunsch algorithm, that allows for expert interaction with the calculation by assigning groupings and rankings to activities, which provide context to the strings. This algorithm has been developed in partnership with UK’s National Health Service (NHS) with the focus on a lung cancer pathway, however the handling of the data and algorithm allows for application to any disease type. This method is contained within Sim.Pro.Flow, a publicly available decision support tool. |
format | Online Article Text |
id | pubmed-7973729 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-79737292021-03-23 Modified Needleman–Wunsch algorithm for clinical pathway clustering Aspland, Emma Harper, Paul R. Gartner, Daniel Webb, Philip Barrett-Lee, Peter J Biomed Inform Original Research Clinical pathways are used to guide clinicians to provide a standardised delivery of care. Because of their standardisation, the aim of clinical pathways is to reduce variation in both care process and patient outcomes. When learning clinical pathways from data through data mining, it is common practice to represent each patient pathway as a string corresponding to their movements through activities. Clustering techniques are popular methods for pathway mining, and therefore this paper focuses on distance metrics applied to string data for k-medoids clustering. The two main aims are to firstly, develop a technique that seamlessly integrates expert information with data and secondly, to develop a string distance metric for the purpose of process data. The overall goal was to allow for more meaningful clustering results to be found by adding context into the string similarity calculation. Eight common distance metrics and their applicability are discussed. These distance metrics prove to give an arbitrary distance, without consideration for context, and each produce different results. As a result, this paper describes the development of a new distance metric, the modified Needleman–Wunsch algorithm, that allows for expert interaction with the calculation by assigning groupings and rankings to activities, which provide context to the strings. This algorithm has been developed in partnership with UK’s National Health Service (NHS) with the focus on a lung cancer pathway, however the handling of the data and algorithm allows for application to any disease type. This method is contained within Sim.Pro.Flow, a publicly available decision support tool. Elsevier 2021-03 /pmc/articles/PMC7973729/ /pubmed/33359110 http://dx.doi.org/10.1016/j.jbi.2020.103668 Text en © 2021 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Original Research Aspland, Emma Harper, Paul R. Gartner, Daniel Webb, Philip Barrett-Lee, Peter Modified Needleman–Wunsch algorithm for clinical pathway clustering |
title | Modified Needleman–Wunsch algorithm for clinical pathway clustering |
title_full | Modified Needleman–Wunsch algorithm for clinical pathway clustering |
title_fullStr | Modified Needleman–Wunsch algorithm for clinical pathway clustering |
title_full_unstemmed | Modified Needleman–Wunsch algorithm for clinical pathway clustering |
title_short | Modified Needleman–Wunsch algorithm for clinical pathway clustering |
title_sort | modified needleman–wunsch algorithm for clinical pathway clustering |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7973729/ https://www.ncbi.nlm.nih.gov/pubmed/33359110 http://dx.doi.org/10.1016/j.jbi.2020.103668 |
work_keys_str_mv | AT asplandemma modifiedneedlemanwunschalgorithmforclinicalpathwayclustering AT harperpaulr modifiedneedlemanwunschalgorithmforclinicalpathwayclustering AT gartnerdaniel modifiedneedlemanwunschalgorithmforclinicalpathwayclustering AT webbphilip modifiedneedlemanwunschalgorithmforclinicalpathwayclustering AT barrettleepeter modifiedneedlemanwunschalgorithmforclinicalpathwayclustering |