Cargando…
Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin
BACKGROUND: Medical decision support systems (CDSSs) are increasingly used in medicine, but their utility in daily medical practice is difficult to evaluate. One variant of CDSS is a generator of differential diagnoses (DDx generator). We performed a feasibility study on three different, publicly av...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9509605/ https://www.ncbi.nlm.nih.gov/pubmed/36153527 http://dx.doi.org/10.1186/s12911-022-01988-2 |
_version_ | 1784797263119253504 |
---|---|
author | Fritz, P. Kleinhans, A. Raoufi, R. Sediqi, A. Schmid, N. Schricker, S. Schanz, M. Fritz-Kuisle, C. Dalquen, P. Firooz, H. Stauch, G. Alscher, M. D. |
author_facet | Fritz, P. Kleinhans, A. Raoufi, R. Sediqi, A. Schmid, N. Schricker, S. Schanz, M. Fritz-Kuisle, C. Dalquen, P. Firooz, H. Stauch, G. Alscher, M. D. |
author_sort | Fritz, P. |
collection | PubMed |
description | BACKGROUND: Medical decision support systems (CDSSs) are increasingly used in medicine, but their utility in daily medical practice is difficult to evaluate. One variant of CDSS is a generator of differential diagnoses (DDx generator). We performed a feasibility study on three different, publicly available data sets of medical cases in order to identify the frequency in which two different DDx generators provide helpful information (either by providing a list of differential diagnosis or recognizing the expert diagnosis if available) for a given case report. METHODS: Used data sets were n = 105 cases from a web-based forum of telemedicine with real life cases from Afghanistan (Afghan data set; AD), n = 124 cases discussed in a web-based medical forum (Coliquio data set; CD). Both websites are restricted for medical professionals only. The third data set consisted 50 special case reports published in the New England Journal of Medicine (NEJM). After keyword extraction, data were entered into two different DDx generators (IsabelHealth (IH), Memem7 (M7)) to examine differences in target diagnosis recognition and physician-rated usefulness between DDx generators. RESULTS: Both DDx generators detected the target diagnosis equally successfully (all cases: M7, 83/170 (49%); IH 90/170 (53%), NEJM: M7, 28/50 (56%); IH, 34/50 (68%); differences n.s.). Differences occurred in AD, where detection of an expert diagnosis was less successful with IH than with M7 (29.7% vs. 54.1%, p = 0.003). In contrast, in CD IH performed significantly better than M7 (73.9% vs. 32.6%, p = 0.021). Congruent identification of target diagnosis occurred in only 46/170 (27.1%) of cases. However, a qualitative analysis of the DDx results revealed useful complements from using the two systems in parallel. CONCLUSION: Both DDx systems IsabelHealth and Memem7 provided substantial help in finding a helpful list of differential diagnoses or identifying the target diagnosis either in standard cases or complicated and rare cases. Our pilot study highlights the need for different levels of complexity and types of real-world medical test cases, as there are significant differences between DDx generators away from traditional case reports. Combining different results from DDx generators seems to be a possible approach for future review and use of the systems. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-01988-2. |
format | Online Article Text |
id | pubmed-9509605 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-95096052022-09-26 Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin Fritz, P. Kleinhans, A. Raoufi, R. Sediqi, A. Schmid, N. Schricker, S. Schanz, M. Fritz-Kuisle, C. Dalquen, P. Firooz, H. Stauch, G. Alscher, M. D. BMC Med Inform Decis Mak Research BACKGROUND: Medical decision support systems (CDSSs) are increasingly used in medicine, but their utility in daily medical practice is difficult to evaluate. One variant of CDSS is a generator of differential diagnoses (DDx generator). We performed a feasibility study on three different, publicly available data sets of medical cases in order to identify the frequency in which two different DDx generators provide helpful information (either by providing a list of differential diagnosis or recognizing the expert diagnosis if available) for a given case report. METHODS: Used data sets were n = 105 cases from a web-based forum of telemedicine with real life cases from Afghanistan (Afghan data set; AD), n = 124 cases discussed in a web-based medical forum (Coliquio data set; CD). Both websites are restricted for medical professionals only. The third data set consisted 50 special case reports published in the New England Journal of Medicine (NEJM). After keyword extraction, data were entered into two different DDx generators (IsabelHealth (IH), Memem7 (M7)) to examine differences in target diagnosis recognition and physician-rated usefulness between DDx generators. RESULTS: Both DDx generators detected the target diagnosis equally successfully (all cases: M7, 83/170 (49%); IH 90/170 (53%), NEJM: M7, 28/50 (56%); IH, 34/50 (68%); differences n.s.). Differences occurred in AD, where detection of an expert diagnosis was less successful with IH than with M7 (29.7% vs. 54.1%, p = 0.003). In contrast, in CD IH performed significantly better than M7 (73.9% vs. 32.6%, p = 0.021). Congruent identification of target diagnosis occurred in only 46/170 (27.1%) of cases. However, a qualitative analysis of the DDx results revealed useful complements from using the two systems in parallel. CONCLUSION: Both DDx systems IsabelHealth and Memem7 provided substantial help in finding a helpful list of differential diagnoses or identifying the target diagnosis either in standard cases or complicated and rare cases. Our pilot study highlights the need for different levels of complexity and types of real-world medical test cases, as there are significant differences between DDx generators away from traditional case reports. Combining different results from DDx generators seems to be a possible approach for future review and use of the systems. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-01988-2. BioMed Central 2022-09-24 /pmc/articles/PMC9509605/ /pubmed/36153527 http://dx.doi.org/10.1186/s12911-022-01988-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Fritz, P. Kleinhans, A. Raoufi, R. Sediqi, A. Schmid, N. Schricker, S. Schanz, M. Fritz-Kuisle, C. Dalquen, P. Firooz, H. Stauch, G. Alscher, M. D. Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin |
title | Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin |
title_full | Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin |
title_fullStr | Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin |
title_full_unstemmed | Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin |
title_short | Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin |
title_sort | evaluation of medical decision support systems (ddx generators) using real medical cases of varying complexity and origin |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9509605/ https://www.ncbi.nlm.nih.gov/pubmed/36153527 http://dx.doi.org/10.1186/s12911-022-01988-2 |
work_keys_str_mv | AT fritzp evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin AT kleinhansa evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin AT raoufir evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin AT sediqia evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin AT schmidn evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin AT schrickers evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin AT schanzm evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin AT fritzkuislec evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin AT dalquenp evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin AT firoozh evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin AT stauchg evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin AT alschermd evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin |