Cargando…
Assessing data gathering of chatbot based symptom checkers - a clinical vignettes study
BACKGROUND: The burden on healthcare systems is mounting continuously owing to population growth and aging, overuse of medical services, and the recent COVID-19 pandemic. This overload is also causing reduced healthcare quality and outcomes. One solution gaining momentum is the integration of intell...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Author(s). Published by Elsevier B.V.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9595333/ https://www.ncbi.nlm.nih.gov/pubmed/36306653 http://dx.doi.org/10.1016/j.ijmedinf.2022.104897 |
_version_ | 1784815625111076864 |
---|---|
author | Ben-Shabat, Niv Sharvit, Gal Meimis, Ben Ben Joya, Daniel Sloma, Ariel Kiderman, David Shabat, Aviv Tsur, Avishai M Watad, Abdulla Amital, Howard |
author_facet | Ben-Shabat, Niv Sharvit, Gal Meimis, Ben Ben Joya, Daniel Sloma, Ariel Kiderman, David Shabat, Aviv Tsur, Avishai M Watad, Abdulla Amital, Howard |
author_sort | Ben-Shabat, Niv |
collection | PubMed |
description | BACKGROUND: The burden on healthcare systems is mounting continuously owing to population growth and aging, overuse of medical services, and the recent COVID-19 pandemic. This overload is also causing reduced healthcare quality and outcomes. One solution gaining momentum is the integration of intelligent self-assessment tools, known as symptom-checkers, into healthcare-providers’ systems. To the best of our knowledge, no study so far has investigated the data-gathering capabilities of these tools, which represent a crucial resource for simulating doctors’ skills in medical-interviews. OBJECTIVES: The goal of this study was to evaluate the data-gathering function of currently available chatbot symptom-checkers. METHODS: We evaluated 8 symptom-checkers using 28 clinical vignettes from the repository of MSD-Manual case studies. The mean number of predefined pertinent findings for each case was 31.8 ± 6.8. The vignettes were entered into the platforms by 3 medical students who simulated the role of the patient. For each conversation, we obtained the number of pertinent findings retrieved and the number of questions asked. We then calculated the recall-rates (pertinent-findings retrieved out of all predefined pertinent-findings), and efficiency-rates (pertinent-findings retrieved out of the number of questions asked) of data-gathering, and compared them between the platforms. RESULTS: The overall recall rate for all symptom-checkers was 0.32(2,280/7,112;95 %CI 0.31–0.33) for all pertinent findings, 0.37(1,110/2,992;95 %CI 0.35–0.39) for present findings, and 0.28(1140/4120;95 %CI 0.26–0.29) for absent findings. Among the symptom-checkers, Kahun platform had the highest recall rate with 0.51(450/889;95 %CI 0.47–0.54). Out of 4,877 questions asked overall, 2,280 findings were gathered, yielding an efficiency rate of 0.46(95 %CI 0.45–0.48) across all platforms. Kahun was the most efficient tool 0.74 (95 %CI 0.70–0.77) without a statistically significant difference from Your.MD 0.69(95 %CI 0.65–0.73). CONCLUSION: The data-gathering performance of currently available symptom checkers is questionable. From among the tools available, Kahun demonstrated the best overall performance. |
format | Online Article Text |
id | pubmed-9595333 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | The Author(s). Published by Elsevier B.V. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95953332022-10-25 Assessing data gathering of chatbot based symptom checkers - a clinical vignettes study Ben-Shabat, Niv Sharvit, Gal Meimis, Ben Ben Joya, Daniel Sloma, Ariel Kiderman, David Shabat, Aviv Tsur, Avishai M Watad, Abdulla Amital, Howard Int J Med Inform Article BACKGROUND: The burden on healthcare systems is mounting continuously owing to population growth and aging, overuse of medical services, and the recent COVID-19 pandemic. This overload is also causing reduced healthcare quality and outcomes. One solution gaining momentum is the integration of intelligent self-assessment tools, known as symptom-checkers, into healthcare-providers’ systems. To the best of our knowledge, no study so far has investigated the data-gathering capabilities of these tools, which represent a crucial resource for simulating doctors’ skills in medical-interviews. OBJECTIVES: The goal of this study was to evaluate the data-gathering function of currently available chatbot symptom-checkers. METHODS: We evaluated 8 symptom-checkers using 28 clinical vignettes from the repository of MSD-Manual case studies. The mean number of predefined pertinent findings for each case was 31.8 ± 6.8. The vignettes were entered into the platforms by 3 medical students who simulated the role of the patient. For each conversation, we obtained the number of pertinent findings retrieved and the number of questions asked. We then calculated the recall-rates (pertinent-findings retrieved out of all predefined pertinent-findings), and efficiency-rates (pertinent-findings retrieved out of the number of questions asked) of data-gathering, and compared them between the platforms. RESULTS: The overall recall rate for all symptom-checkers was 0.32(2,280/7,112;95 %CI 0.31–0.33) for all pertinent findings, 0.37(1,110/2,992;95 %CI 0.35–0.39) for present findings, and 0.28(1140/4120;95 %CI 0.26–0.29) for absent findings. Among the symptom-checkers, Kahun platform had the highest recall rate with 0.51(450/889;95 %CI 0.47–0.54). Out of 4,877 questions asked overall, 2,280 findings were gathered, yielding an efficiency rate of 0.46(95 %CI 0.45–0.48) across all platforms. Kahun was the most efficient tool 0.74 (95 %CI 0.70–0.77) without a statistically significant difference from Your.MD 0.69(95 %CI 0.65–0.73). CONCLUSION: The data-gathering performance of currently available symptom checkers is questionable. From among the tools available, Kahun demonstrated the best overall performance. The Author(s). Published by Elsevier B.V. 2022-12 2022-10-22 /pmc/articles/PMC9595333/ /pubmed/36306653 http://dx.doi.org/10.1016/j.ijmedinf.2022.104897 Text en © 2022 The Author(s) Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Ben-Shabat, Niv Sharvit, Gal Meimis, Ben Ben Joya, Daniel Sloma, Ariel Kiderman, David Shabat, Aviv Tsur, Avishai M Watad, Abdulla Amital, Howard Assessing data gathering of chatbot based symptom checkers - a clinical vignettes study |
title | Assessing data gathering of chatbot based symptom checkers - a clinical vignettes study |
title_full | Assessing data gathering of chatbot based symptom checkers - a clinical vignettes study |
title_fullStr | Assessing data gathering of chatbot based symptom checkers - a clinical vignettes study |
title_full_unstemmed | Assessing data gathering of chatbot based symptom checkers - a clinical vignettes study |
title_short | Assessing data gathering of chatbot based symptom checkers - a clinical vignettes study |
title_sort | assessing data gathering of chatbot based symptom checkers - a clinical vignettes study |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9595333/ https://www.ncbi.nlm.nih.gov/pubmed/36306653 http://dx.doi.org/10.1016/j.ijmedinf.2022.104897 |
work_keys_str_mv | AT benshabatniv assessingdatagatheringofchatbotbasedsymptomcheckersaclinicalvignettesstudy AT sharvitgal assessingdatagatheringofchatbotbasedsymptomcheckersaclinicalvignettesstudy AT meimisben assessingdatagatheringofchatbotbasedsymptomcheckersaclinicalvignettesstudy AT benjoyadaniel assessingdatagatheringofchatbotbasedsymptomcheckersaclinicalvignettesstudy AT slomaariel assessingdatagatheringofchatbotbasedsymptomcheckersaclinicalvignettesstudy AT kidermandavid assessingdatagatheringofchatbotbasedsymptomcheckersaclinicalvignettesstudy AT shabataviv assessingdatagatheringofchatbotbasedsymptomcheckersaclinicalvignettesstudy AT tsuravishaim assessingdatagatheringofchatbotbasedsymptomcheckersaclinicalvignettesstudy AT watadabdulla assessingdatagatheringofchatbotbasedsymptomcheckersaclinicalvignettesstudy AT amitalhoward assessingdatagatheringofchatbotbasedsymptomcheckersaclinicalvignettesstudy |