Cargando…
Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study
BACKGROUND: Low diagnostic accuracy is a major concern in automated medical history–taking systems with differential diagnosis (DDx) generators. Extending the concept of collective intelligence to the field of DDx generators such that the accuracy of judgment becomes higher when accepting an integra...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10433017/ https://www.ncbi.nlm.nih.gov/pubmed/37531164 http://dx.doi.org/10.2196/49034 |
_version_ | 1785091556755111936 |
---|---|
author | Harada, Yukinori Tomiyama, Shusaku Sakamoto, Tetsu Sugimoto, Shu Kawamura, Ren Yokose, Masashi Hayashi, Arisa Shimizu, Taro |
author_facet | Harada, Yukinori Tomiyama, Shusaku Sakamoto, Tetsu Sugimoto, Shu Kawamura, Ren Yokose, Masashi Hayashi, Arisa Shimizu, Taro |
author_sort | Harada, Yukinori |
collection | PubMed |
description | BACKGROUND: Low diagnostic accuracy is a major concern in automated medical history–taking systems with differential diagnosis (DDx) generators. Extending the concept of collective intelligence to the field of DDx generators such that the accuracy of judgment becomes higher when accepting an integrated diagnosis list from multiple people than when accepting a diagnosis list from a single person may be a possible solution. OBJECTIVE: The purpose of this study is to assess whether the combined use of several DDx generators improves the diagnostic accuracy of DDx lists. METHODS: We used medical history data and the top 10 DDx lists (index DDx lists) generated by an artificial intelligence (AI)–driven automated medical history–taking system from 103 patients with confirmed diagnoses. Two research physicians independently created the other top 10 DDx lists (second and third DDx lists) per case by imputing key information into the other 2 DDx generators based on the medical history generated by the automated medical history–taking system without reading the index lists generated by the automated medical history–taking system. We used the McNemar test to assess the improvement in diagnostic accuracy from the index DDx lists to the three types of combined DDx lists: (1) simply combining DDx lists from the index, second, and third lists; (2) creating a new top 10 DDx list using a 1/n weighting rule; and (3) creating new lists with only shared diagnoses among DDx lists from the index, second, and third lists. We treated the data generated by 2 research physicians from the same patient as independent cases. Therefore, the number of cases included in analyses in the case using 2 additional lists was 206 (103 cases × 2 physicians’ input). RESULTS: The diagnostic accuracy of the index lists was 46% (47/103). Diagnostic accuracy was improved by simply combining the other 2 DDx lists (133/206, 65%, P<.001), whereas the other 2 combined DDx lists did not improve the diagnostic accuracy of the DDx lists (106/206, 52%, P=.05 in the collective list with the 1/n weighting rule and 29/206, 14%, P<.001 in the only shared diagnoses among the 3 DDx lists). CONCLUSIONS: Simply adding each of the top 10 DDx lists from additional DDx generators increased the diagnostic accuracy of the DDx list by approximately 20%, suggesting that the combinational use of DDx generators early in the diagnostic process is beneficial. |
format | Online Article Text |
id | pubmed-10433017 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-104330172023-08-18 Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study Harada, Yukinori Tomiyama, Shusaku Sakamoto, Tetsu Sugimoto, Shu Kawamura, Ren Yokose, Masashi Hayashi, Arisa Shimizu, Taro JMIR Form Res Original Paper BACKGROUND: Low diagnostic accuracy is a major concern in automated medical history–taking systems with differential diagnosis (DDx) generators. Extending the concept of collective intelligence to the field of DDx generators such that the accuracy of judgment becomes higher when accepting an integrated diagnosis list from multiple people than when accepting a diagnosis list from a single person may be a possible solution. OBJECTIVE: The purpose of this study is to assess whether the combined use of several DDx generators improves the diagnostic accuracy of DDx lists. METHODS: We used medical history data and the top 10 DDx lists (index DDx lists) generated by an artificial intelligence (AI)–driven automated medical history–taking system from 103 patients with confirmed diagnoses. Two research physicians independently created the other top 10 DDx lists (second and third DDx lists) per case by imputing key information into the other 2 DDx generators based on the medical history generated by the automated medical history–taking system without reading the index lists generated by the automated medical history–taking system. We used the McNemar test to assess the improvement in diagnostic accuracy from the index DDx lists to the three types of combined DDx lists: (1) simply combining DDx lists from the index, second, and third lists; (2) creating a new top 10 DDx list using a 1/n weighting rule; and (3) creating new lists with only shared diagnoses among DDx lists from the index, second, and third lists. We treated the data generated by 2 research physicians from the same patient as independent cases. Therefore, the number of cases included in analyses in the case using 2 additional lists was 206 (103 cases × 2 physicians’ input). RESULTS: The diagnostic accuracy of the index lists was 46% (47/103). Diagnostic accuracy was improved by simply combining the other 2 DDx lists (133/206, 65%, P<.001), whereas the other 2 combined DDx lists did not improve the diagnostic accuracy of the DDx lists (106/206, 52%, P=.05 in the collective list with the 1/n weighting rule and 29/206, 14%, P<.001 in the only shared diagnoses among the 3 DDx lists). CONCLUSIONS: Simply adding each of the top 10 DDx lists from additional DDx generators increased the diagnostic accuracy of the DDx list by approximately 20%, suggesting that the combinational use of DDx generators early in the diagnostic process is beneficial. JMIR Publications 2023-08-02 /pmc/articles/PMC10433017/ /pubmed/37531164 http://dx.doi.org/10.2196/49034 Text en ©Yukinori Harada, Shusaku Tomiyama, Tetsu Sakamoto, Shu Sugimoto, Ren Kawamura, Masashi Yokose, Arisa Hayashi, Taro Shimizu. Originally published in JMIR Formative Research (https://formative.jmir.org), 02.08.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on https://formative.jmir.org, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Harada, Yukinori Tomiyama, Shusaku Sakamoto, Tetsu Sugimoto, Shu Kawamura, Ren Yokose, Masashi Hayashi, Arisa Shimizu, Taro Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study |
title | Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study |
title_full | Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study |
title_fullStr | Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study |
title_full_unstemmed | Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study |
title_short | Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study |
title_sort | effects of combinational use of additional differential diagnostic generators on the diagnostic accuracy of the differential diagnosis list developed by an artificial intelligence–driven automated history–taking system: pilot cross-sectional study |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10433017/ https://www.ncbi.nlm.nih.gov/pubmed/37531164 http://dx.doi.org/10.2196/49034 |
work_keys_str_mv | AT haradayukinori effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy AT tomiyamashusaku effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy AT sakamototetsu effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy AT sugimotoshu effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy AT kawamuraren effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy AT yokosemasashi effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy AT hayashiarisa effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy AT shimizutaro effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy |