Cargando…

Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study

BACKGROUND: Low diagnostic accuracy is a major concern in automated medical history–taking systems with differential diagnosis (DDx) generators. Extending the concept of collective intelligence to the field of DDx generators such that the accuracy of judgment becomes higher when accepting an integra...

Descripción completa

Detalles Bibliográficos
Autores principales: Harada, Yukinori, Tomiyama, Shusaku, Sakamoto, Tetsu, Sugimoto, Shu, Kawamura, Ren, Yokose, Masashi, Hayashi, Arisa, Shimizu, Taro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10433017/
https://www.ncbi.nlm.nih.gov/pubmed/37531164
http://dx.doi.org/10.2196/49034
_version_ 1785091556755111936
author Harada, Yukinori
Tomiyama, Shusaku
Sakamoto, Tetsu
Sugimoto, Shu
Kawamura, Ren
Yokose, Masashi
Hayashi, Arisa
Shimizu, Taro
author_facet Harada, Yukinori
Tomiyama, Shusaku
Sakamoto, Tetsu
Sugimoto, Shu
Kawamura, Ren
Yokose, Masashi
Hayashi, Arisa
Shimizu, Taro
author_sort Harada, Yukinori
collection PubMed
description BACKGROUND: Low diagnostic accuracy is a major concern in automated medical history–taking systems with differential diagnosis (DDx) generators. Extending the concept of collective intelligence to the field of DDx generators such that the accuracy of judgment becomes higher when accepting an integrated diagnosis list from multiple people than when accepting a diagnosis list from a single person may be a possible solution. OBJECTIVE: The purpose of this study is to assess whether the combined use of several DDx generators improves the diagnostic accuracy of DDx lists. METHODS: We used medical history data and the top 10 DDx lists (index DDx lists) generated by an artificial intelligence (AI)–driven automated medical history–taking system from 103 patients with confirmed diagnoses. Two research physicians independently created the other top 10 DDx lists (second and third DDx lists) per case by imputing key information into the other 2 DDx generators based on the medical history generated by the automated medical history–taking system without reading the index lists generated by the automated medical history–taking system. We used the McNemar test to assess the improvement in diagnostic accuracy from the index DDx lists to the three types of combined DDx lists: (1) simply combining DDx lists from the index, second, and third lists; (2) creating a new top 10 DDx list using a 1/n weighting rule; and (3) creating new lists with only shared diagnoses among DDx lists from the index, second, and third lists. We treated the data generated by 2 research physicians from the same patient as independent cases. Therefore, the number of cases included in analyses in the case using 2 additional lists was 206 (103 cases × 2 physicians’ input). RESULTS: The diagnostic accuracy of the index lists was 46% (47/103). Diagnostic accuracy was improved by simply combining the other 2 DDx lists (133/206, 65%, P<.001), whereas the other 2 combined DDx lists did not improve the diagnostic accuracy of the DDx lists (106/206, 52%, P=.05 in the collective list with the 1/n weighting rule and 29/206, 14%, P<.001 in the only shared diagnoses among the 3 DDx lists). CONCLUSIONS: Simply adding each of the top 10 DDx lists from additional DDx generators increased the diagnostic accuracy of the DDx list by approximately 20%, suggesting that the combinational use of DDx generators early in the diagnostic process is beneficial.
format Online
Article
Text
id pubmed-10433017
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-104330172023-08-18 Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study Harada, Yukinori Tomiyama, Shusaku Sakamoto, Tetsu Sugimoto, Shu Kawamura, Ren Yokose, Masashi Hayashi, Arisa Shimizu, Taro JMIR Form Res Original Paper BACKGROUND: Low diagnostic accuracy is a major concern in automated medical history–taking systems with differential diagnosis (DDx) generators. Extending the concept of collective intelligence to the field of DDx generators such that the accuracy of judgment becomes higher when accepting an integrated diagnosis list from multiple people than when accepting a diagnosis list from a single person may be a possible solution. OBJECTIVE: The purpose of this study is to assess whether the combined use of several DDx generators improves the diagnostic accuracy of DDx lists. METHODS: We used medical history data and the top 10 DDx lists (index DDx lists) generated by an artificial intelligence (AI)–driven automated medical history–taking system from 103 patients with confirmed diagnoses. Two research physicians independently created the other top 10 DDx lists (second and third DDx lists) per case by imputing key information into the other 2 DDx generators based on the medical history generated by the automated medical history–taking system without reading the index lists generated by the automated medical history–taking system. We used the McNemar test to assess the improvement in diagnostic accuracy from the index DDx lists to the three types of combined DDx lists: (1) simply combining DDx lists from the index, second, and third lists; (2) creating a new top 10 DDx list using a 1/n weighting rule; and (3) creating new lists with only shared diagnoses among DDx lists from the index, second, and third lists. We treated the data generated by 2 research physicians from the same patient as independent cases. Therefore, the number of cases included in analyses in the case using 2 additional lists was 206 (103 cases × 2 physicians’ input). RESULTS: The diagnostic accuracy of the index lists was 46% (47/103). Diagnostic accuracy was improved by simply combining the other 2 DDx lists (133/206, 65%, P<.001), whereas the other 2 combined DDx lists did not improve the diagnostic accuracy of the DDx lists (106/206, 52%, P=.05 in the collective list with the 1/n weighting rule and 29/206, 14%, P<.001 in the only shared diagnoses among the 3 DDx lists). CONCLUSIONS: Simply adding each of the top 10 DDx lists from additional DDx generators increased the diagnostic accuracy of the DDx list by approximately 20%, suggesting that the combinational use of DDx generators early in the diagnostic process is beneficial. JMIR Publications 2023-08-02 /pmc/articles/PMC10433017/ /pubmed/37531164 http://dx.doi.org/10.2196/49034 Text en ©Yukinori Harada, Shusaku Tomiyama, Tetsu Sakamoto, Shu Sugimoto, Ren Kawamura, Masashi Yokose, Arisa Hayashi, Taro Shimizu. Originally published in JMIR Formative Research (https://formative.jmir.org), 02.08.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on https://formative.jmir.org, as well as this copyright and license information must be included.
spellingShingle Original Paper
Harada, Yukinori
Tomiyama, Shusaku
Sakamoto, Tetsu
Sugimoto, Shu
Kawamura, Ren
Yokose, Masashi
Hayashi, Arisa
Shimizu, Taro
Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study
title Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study
title_full Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study
title_fullStr Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study
title_full_unstemmed Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study
title_short Effects of Combinational Use of Additional Differential Diagnostic Generators on the Diagnostic Accuracy of the Differential Diagnosis List Developed by an Artificial Intelligence–Driven Automated History–Taking System: Pilot Cross-Sectional Study
title_sort effects of combinational use of additional differential diagnostic generators on the diagnostic accuracy of the differential diagnosis list developed by an artificial intelligence–driven automated history–taking system: pilot cross-sectional study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10433017/
https://www.ncbi.nlm.nih.gov/pubmed/37531164
http://dx.doi.org/10.2196/49034
work_keys_str_mv AT haradayukinori effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy
AT tomiyamashusaku effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy
AT sakamototetsu effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy
AT sugimotoshu effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy
AT kawamuraren effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy
AT yokosemasashi effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy
AT hayashiarisa effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy
AT shimizutaro effectsofcombinationaluseofadditionaldifferentialdiagnosticgeneratorsonthediagnosticaccuracyofthedifferentialdiagnosislistdevelopedbyanartificialintelligencedrivenautomatedhistorytakingsystempilotcrosssectionalstudy