Cargando…

Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin

BACKGROUND: Medical decision support systems (CDSSs) are increasingly used in medicine, but their utility in daily medical practice is difficult to evaluate. One variant of CDSS is a generator of differential diagnoses (DDx generator). We performed a feasibility study on three different, publicly av...

Descripción completa

Detalles Bibliográficos
Autores principales: Fritz, P., Kleinhans, A., Raoufi, R., Sediqi, A., Schmid, N., Schricker, S., Schanz, M., Fritz-Kuisle, C., Dalquen, P., Firooz, H., Stauch, G., Alscher, M. D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9509605/
https://www.ncbi.nlm.nih.gov/pubmed/36153527
http://dx.doi.org/10.1186/s12911-022-01988-2
_version_ 1784797263119253504
author Fritz, P.
Kleinhans, A.
Raoufi, R.
Sediqi, A.
Schmid, N.
Schricker, S.
Schanz, M.
Fritz-Kuisle, C.
Dalquen, P.
Firooz, H.
Stauch, G.
Alscher, M. D.
author_facet Fritz, P.
Kleinhans, A.
Raoufi, R.
Sediqi, A.
Schmid, N.
Schricker, S.
Schanz, M.
Fritz-Kuisle, C.
Dalquen, P.
Firooz, H.
Stauch, G.
Alscher, M. D.
author_sort Fritz, P.
collection PubMed
description BACKGROUND: Medical decision support systems (CDSSs) are increasingly used in medicine, but their utility in daily medical practice is difficult to evaluate. One variant of CDSS is a generator of differential diagnoses (DDx generator). We performed a feasibility study on three different, publicly available data sets of medical cases in order to identify the frequency in which two different DDx generators provide helpful information (either by providing a list of differential diagnosis or recognizing the expert diagnosis if available) for a given case report. METHODS: Used data sets were n = 105 cases from a web-based forum of telemedicine with real life cases from Afghanistan (Afghan data set; AD), n = 124 cases discussed in a web-based medical forum (Coliquio data set; CD). Both websites are restricted for medical professionals only. The third data set consisted 50 special case reports published in the New England Journal of Medicine (NEJM). After keyword extraction, data were entered into two different DDx generators (IsabelHealth (IH), Memem7 (M7)) to examine differences in target diagnosis recognition and physician-rated usefulness between DDx generators. RESULTS: Both DDx generators detected the target diagnosis equally successfully (all cases: M7, 83/170 (49%); IH 90/170 (53%), NEJM: M7, 28/50 (56%); IH, 34/50 (68%); differences n.s.). Differences occurred in AD, where detection of an expert diagnosis was less successful with IH than with M7 (29.7% vs. 54.1%, p = 0.003). In contrast, in CD IH performed significantly better than M7 (73.9% vs. 32.6%, p = 0.021). Congruent identification of target diagnosis occurred in only 46/170 (27.1%) of cases. However, a qualitative analysis of the DDx results revealed useful complements from using the two systems in parallel. CONCLUSION: Both DDx systems IsabelHealth and Memem7 provided substantial help in finding a helpful list of differential diagnoses or identifying the target diagnosis either in standard cases or complicated and rare cases. Our pilot study highlights the need for different levels of complexity and types of real-world medical test cases, as there are significant differences between DDx generators away from traditional case reports. Combining different results from DDx generators seems to be a possible approach for future review and use of the systems. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-01988-2.
format Online
Article
Text
id pubmed-9509605
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-95096052022-09-26 Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin Fritz, P. Kleinhans, A. Raoufi, R. Sediqi, A. Schmid, N. Schricker, S. Schanz, M. Fritz-Kuisle, C. Dalquen, P. Firooz, H. Stauch, G. Alscher, M. D. BMC Med Inform Decis Mak Research BACKGROUND: Medical decision support systems (CDSSs) are increasingly used in medicine, but their utility in daily medical practice is difficult to evaluate. One variant of CDSS is a generator of differential diagnoses (DDx generator). We performed a feasibility study on three different, publicly available data sets of medical cases in order to identify the frequency in which two different DDx generators provide helpful information (either by providing a list of differential diagnosis or recognizing the expert diagnosis if available) for a given case report. METHODS: Used data sets were n = 105 cases from a web-based forum of telemedicine with real life cases from Afghanistan (Afghan data set; AD), n = 124 cases discussed in a web-based medical forum (Coliquio data set; CD). Both websites are restricted for medical professionals only. The third data set consisted 50 special case reports published in the New England Journal of Medicine (NEJM). After keyword extraction, data were entered into two different DDx generators (IsabelHealth (IH), Memem7 (M7)) to examine differences in target diagnosis recognition and physician-rated usefulness between DDx generators. RESULTS: Both DDx generators detected the target diagnosis equally successfully (all cases: M7, 83/170 (49%); IH 90/170 (53%), NEJM: M7, 28/50 (56%); IH, 34/50 (68%); differences n.s.). Differences occurred in AD, where detection of an expert diagnosis was less successful with IH than with M7 (29.7% vs. 54.1%, p = 0.003). In contrast, in CD IH performed significantly better than M7 (73.9% vs. 32.6%, p = 0.021). Congruent identification of target diagnosis occurred in only 46/170 (27.1%) of cases. However, a qualitative analysis of the DDx results revealed useful complements from using the two systems in parallel. CONCLUSION: Both DDx systems IsabelHealth and Memem7 provided substantial help in finding a helpful list of differential diagnoses or identifying the target diagnosis either in standard cases or complicated and rare cases. Our pilot study highlights the need for different levels of complexity and types of real-world medical test cases, as there are significant differences between DDx generators away from traditional case reports. Combining different results from DDx generators seems to be a possible approach for future review and use of the systems. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-022-01988-2. BioMed Central 2022-09-24 /pmc/articles/PMC9509605/ /pubmed/36153527 http://dx.doi.org/10.1186/s12911-022-01988-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Fritz, P.
Kleinhans, A.
Raoufi, R.
Sediqi, A.
Schmid, N.
Schricker, S.
Schanz, M.
Fritz-Kuisle, C.
Dalquen, P.
Firooz, H.
Stauch, G.
Alscher, M. D.
Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin
title Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin
title_full Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin
title_fullStr Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin
title_full_unstemmed Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin
title_short Evaluation of medical decision support systems (DDX generators) using real medical cases of varying complexity and origin
title_sort evaluation of medical decision support systems (ddx generators) using real medical cases of varying complexity and origin
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9509605/
https://www.ncbi.nlm.nih.gov/pubmed/36153527
http://dx.doi.org/10.1186/s12911-022-01988-2
work_keys_str_mv AT fritzp evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin
AT kleinhansa evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin
AT raoufir evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin
AT sediqia evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin
AT schmidn evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin
AT schrickers evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin
AT schanzm evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin
AT fritzkuislec evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin
AT dalquenp evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin
AT firoozh evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin
AT stauchg evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin
AT alschermd evaluationofmedicaldecisionsupportsystemsddxgeneratorsusingrealmedicalcasesofvaryingcomplexityandorigin