Cargando…

What is the suitability of clinical vignettes in benchmarking the performance of online symptom checkers? An audit study

OBJECTIVE: Assess the suitability of clinical vignettes in benchmarking the performance of online symptom checkers (OSCs). DESIGN: Observational study using a publicly available free OSC. PARTICIPANTS: Healthily OSC, which provided consultations in English, was used to record consultation outcomes f...

Descripción completa

Detalles Bibliográficos
Autores principales: El-Osta, Austen, Webber, Iman, Alaa, Aos, Bagkeris, Emmanouil, Mian, Saba, Taghavi Azar Sharabiani, Mansour, Majeed, Azeem
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9047920/
https://www.ncbi.nlm.nih.gov/pubmed/35477872
http://dx.doi.org/10.1136/bmjopen-2021-053566
_version_ 1784695829305491456
author El-Osta, Austen
Webber, Iman
Alaa, Aos
Bagkeris, Emmanouil
Mian, Saba
Taghavi Azar Sharabiani, Mansour
Majeed, Azeem
author_facet El-Osta, Austen
Webber, Iman
Alaa, Aos
Bagkeris, Emmanouil
Mian, Saba
Taghavi Azar Sharabiani, Mansour
Majeed, Azeem
author_sort El-Osta, Austen
collection PubMed
description OBJECTIVE: Assess the suitability of clinical vignettes in benchmarking the performance of online symptom checkers (OSCs). DESIGN: Observational study using a publicly available free OSC. PARTICIPANTS: Healthily OSC, which provided consultations in English, was used to record consultation outcomes from two lay and four expert inputters using 139 standardised patient vignettes. Each vignette included three diagnostic solutions and a triage recommendation in one of three categories of triage urgency. A panel of three independent general practitioners interpreted the vignettes to arrive at an alternative set of diagnostic and triage solutions. Both sets of diagnostic and triage solutions were consolidated to arrive at a final consolidated version for benchmarking. MAIN OUTCOME MEASURES: Six inputters simulated 834 standardised patient evaluations using Healthily OSC and recorded outputs (triage solution, signposting, and whether the correct diagnostic solution appeared first or within the first three differentials). We estimated Cohen’s kappa to assess how interpretations by different inputters could lead to divergent OSC output even when using the same vignette or when compared with a separate panel of physicians. RESULTS: There was moderate agreement on triage recommendation (kappa=0.48), and substantial agreement on consultation outcomes between all inputters (kappa=0.73). OSC performance improved significantly from baseline when compared against the final consolidated diagnostic and triage solution (p<0.001). CONCLUSIONS: Clinical vignettes are inherently limited in their utility to benchmark the diagnostic accuracy or triage safety of OSC. Real-world evidence studies involving real patients are recommended to benchmark the performance of OSC against a panel of physicians.
format Online
Article
Text
id pubmed-9047920
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-90479202022-05-11 What is the suitability of clinical vignettes in benchmarking the performance of online symptom checkers? An audit study El-Osta, Austen Webber, Iman Alaa, Aos Bagkeris, Emmanouil Mian, Saba Taghavi Azar Sharabiani, Mansour Majeed, Azeem BMJ Open General practice / Family practice OBJECTIVE: Assess the suitability of clinical vignettes in benchmarking the performance of online symptom checkers (OSCs). DESIGN: Observational study using a publicly available free OSC. PARTICIPANTS: Healthily OSC, which provided consultations in English, was used to record consultation outcomes from two lay and four expert inputters using 139 standardised patient vignettes. Each vignette included three diagnostic solutions and a triage recommendation in one of three categories of triage urgency. A panel of three independent general practitioners interpreted the vignettes to arrive at an alternative set of diagnostic and triage solutions. Both sets of diagnostic and triage solutions were consolidated to arrive at a final consolidated version for benchmarking. MAIN OUTCOME MEASURES: Six inputters simulated 834 standardised patient evaluations using Healthily OSC and recorded outputs (triage solution, signposting, and whether the correct diagnostic solution appeared first or within the first three differentials). We estimated Cohen’s kappa to assess how interpretations by different inputters could lead to divergent OSC output even when using the same vignette or when compared with a separate panel of physicians. RESULTS: There was moderate agreement on triage recommendation (kappa=0.48), and substantial agreement on consultation outcomes between all inputters (kappa=0.73). OSC performance improved significantly from baseline when compared against the final consolidated diagnostic and triage solution (p<0.001). CONCLUSIONS: Clinical vignettes are inherently limited in their utility to benchmark the diagnostic accuracy or triage safety of OSC. Real-world evidence studies involving real patients are recommended to benchmark the performance of OSC against a panel of physicians. BMJ Publishing Group 2022-04-27 /pmc/articles/PMC9047920/ /pubmed/35477872 http://dx.doi.org/10.1136/bmjopen-2021-053566 Text en © Author(s) (or their employer(s)) 2022. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle General practice / Family practice
El-Osta, Austen
Webber, Iman
Alaa, Aos
Bagkeris, Emmanouil
Mian, Saba
Taghavi Azar Sharabiani, Mansour
Majeed, Azeem
What is the suitability of clinical vignettes in benchmarking the performance of online symptom checkers? An audit study
title What is the suitability of clinical vignettes in benchmarking the performance of online symptom checkers? An audit study
title_full What is the suitability of clinical vignettes in benchmarking the performance of online symptom checkers? An audit study
title_fullStr What is the suitability of clinical vignettes in benchmarking the performance of online symptom checkers? An audit study
title_full_unstemmed What is the suitability of clinical vignettes in benchmarking the performance of online symptom checkers? An audit study
title_short What is the suitability of clinical vignettes in benchmarking the performance of online symptom checkers? An audit study
title_sort what is the suitability of clinical vignettes in benchmarking the performance of online symptom checkers? an audit study
topic General practice / Family practice
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9047920/
https://www.ncbi.nlm.nih.gov/pubmed/35477872
http://dx.doi.org/10.1136/bmjopen-2021-053566
work_keys_str_mv AT elostaausten whatisthesuitabilityofclinicalvignettesinbenchmarkingtheperformanceofonlinesymptomcheckersanauditstudy
AT webberiman whatisthesuitabilityofclinicalvignettesinbenchmarkingtheperformanceofonlinesymptomcheckersanauditstudy
AT alaaaos whatisthesuitabilityofclinicalvignettesinbenchmarkingtheperformanceofonlinesymptomcheckersanauditstudy
AT bagkerisemmanouil whatisthesuitabilityofclinicalvignettesinbenchmarkingtheperformanceofonlinesymptomcheckersanauditstudy
AT miansaba whatisthesuitabilityofclinicalvignettesinbenchmarkingtheperformanceofonlinesymptomcheckersanauditstudy
AT taghaviazarsharabianimansour whatisthesuitabilityofclinicalvignettesinbenchmarkingtheperformanceofonlinesymptomcheckersanauditstudy
AT majeedazeem whatisthesuitabilityofclinicalvignettesinbenchmarkingtheperformanceofonlinesymptomcheckersanauditstudy