Cargando…

Association of Clinician Diagnostic Performance With Machine Learning–Based Decision Support Systems: A Systematic Review

IMPORTANCE: An increasing number of machine learning (ML)–based clinical decision support systems (CDSSs) are described in the medical literature, but this research focuses almost entirely on comparing CDSS directly with clinicians (human vs computer). Little is known about the outcomes of these sys...

Descripción completa

Detalles Bibliográficos
Autores principales:	Vasey, Baptiste, Ursprung, Stephan, Beddoe, Benjamin, Taylor, Elliott H., Marlow, Neale, Bilbro, Nicole, Watkinson, Peter, McCulloch, Peter
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	American Medical Association 2021
Materias:	Original Investigation
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7953308/ https://www.ncbi.nlm.nih.gov/pubmed/33704476 http://dx.doi.org/10.1001/jamanetworkopen.2021.1276

_version_	1783663892622213120
author	Vasey, Baptiste Ursprung, Stephan Beddoe, Benjamin Taylor, Elliott H. Marlow, Neale Bilbro, Nicole Watkinson, Peter McCulloch, Peter
author_facet	Vasey, Baptiste Ursprung, Stephan Beddoe, Benjamin Taylor, Elliott H. Marlow, Neale Bilbro, Nicole Watkinson, Peter McCulloch, Peter
author_sort	Vasey, Baptiste
collection	PubMed
description	IMPORTANCE: An increasing number of machine learning (ML)–based clinical decision support systems (CDSSs) are described in the medical literature, but this research focuses almost entirely on comparing CDSS directly with clinicians (human vs computer). Little is known about the outcomes of these systems when used as adjuncts to human decision-making (human vs human with computer). OBJECTIVES: To conduct a systematic review to investigate the association between the interactive use of ML-based diagnostic CDSSs and clinician performance and to examine the extent of the CDSSs’ human factors evaluation. EVIDENCE REVIEW: A search of MEDLINE, Embase, PsycINFO, and grey literature was conducted for the period between January 1, 2010, and May 31, 2019. Peer-reviewed studies published in English comparing human clinician performance with and without interactive use of an ML-based diagnostic CDSSs were included. All metrics used to assess human performance were considered as outcomes. The risk of bias was assessed using Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) and Risk of Bias in Non-Randomised Studies–Intervention (ROBINS-I). Narrative summaries were produced for the main outcomes. Given the heterogeneity of medical conditions, outcomes of interest, and evaluation metrics, no meta-analysis was performed. FINDINGS: A total of 8112 studies were initially retrieved and 5154 abstracts were screened; of these, 37 studies met the inclusion criteria. The median number of participating clinicians was 4 (interquartile range, 3-8). Of the 107 results that reported statistical significance, 54 (50%) were increased by the use of CDSSs, 4 (4%) were decreased, and 49 (46%) showed no change or an unclear change. In the subgroup of studies carried out in representative clinical settings, no association between the use of ML-based diagnostic CDSSs and improved clinician performance could be observed. Interobserver agreement was the commonly reported outcome whose change was the most strongly associated with CDSS use. Four studies (11%) reported on user feedback, and, in all but 1 case, clinicians decided to override at least some of the algorithms’ recommendations. Twenty-eight studies (76%) were rated as having a high risk of bias in at least 1 of the 4 QUADAS-2 core domains, and 6 studies (16%) were considered to be at serious or critical risk of bias using ROBINS-I. CONCLUSIONS AND RELEVANCE: This systematic review found only sparse evidence that the use of ML-based CDSSs is associated with improved clinician diagnostic performance. Most studies had a low number of participants, were at high or unclear risk of bias, and showed little or no consideration for human factors. Caution should be exercised when estimating the current potential of ML to improve human diagnostic performance, and more comprehensive evaluation should be conducted before deploying ML-based CDSSs in clinical settings. The results highlight the importance of considering supported human decisions as end points rather than merely the stand-alone CDSSs outputs.
format	Online Article Text
id	pubmed-7953308
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	American Medical Association
record_format	MEDLINE/PubMed
spelling	pubmed-79533082021-03-28 Association of Clinician Diagnostic Performance With Machine Learning–Based Decision Support Systems: A Systematic Review Vasey, Baptiste Ursprung, Stephan Beddoe, Benjamin Taylor, Elliott H. Marlow, Neale Bilbro, Nicole Watkinson, Peter McCulloch, Peter JAMA Netw Open Original Investigation IMPORTANCE: An increasing number of machine learning (ML)–based clinical decision support systems (CDSSs) are described in the medical literature, but this research focuses almost entirely on comparing CDSS directly with clinicians (human vs computer). Little is known about the outcomes of these systems when used as adjuncts to human decision-making (human vs human with computer). OBJECTIVES: To conduct a systematic review to investigate the association between the interactive use of ML-based diagnostic CDSSs and clinician performance and to examine the extent of the CDSSs’ human factors evaluation. EVIDENCE REVIEW: A search of MEDLINE, Embase, PsycINFO, and grey literature was conducted for the period between January 1, 2010, and May 31, 2019. Peer-reviewed studies published in English comparing human clinician performance with and without interactive use of an ML-based diagnostic CDSSs were included. All metrics used to assess human performance were considered as outcomes. The risk of bias was assessed using Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) and Risk of Bias in Non-Randomised Studies–Intervention (ROBINS-I). Narrative summaries were produced for the main outcomes. Given the heterogeneity of medical conditions, outcomes of interest, and evaluation metrics, no meta-analysis was performed. FINDINGS: A total of 8112 studies were initially retrieved and 5154 abstracts were screened; of these, 37 studies met the inclusion criteria. The median number of participating clinicians was 4 (interquartile range, 3-8). Of the 107 results that reported statistical significance, 54 (50%) were increased by the use of CDSSs, 4 (4%) were decreased, and 49 (46%) showed no change or an unclear change. In the subgroup of studies carried out in representative clinical settings, no association between the use of ML-based diagnostic CDSSs and improved clinician performance could be observed. Interobserver agreement was the commonly reported outcome whose change was the most strongly associated with CDSS use. Four studies (11%) reported on user feedback, and, in all but 1 case, clinicians decided to override at least some of the algorithms’ recommendations. Twenty-eight studies (76%) were rated as having a high risk of bias in at least 1 of the 4 QUADAS-2 core domains, and 6 studies (16%) were considered to be at serious or critical risk of bias using ROBINS-I. CONCLUSIONS AND RELEVANCE: This systematic review found only sparse evidence that the use of ML-based CDSSs is associated with improved clinician diagnostic performance. Most studies had a low number of participants, were at high or unclear risk of bias, and showed little or no consideration for human factors. Caution should be exercised when estimating the current potential of ML to improve human diagnostic performance, and more comprehensive evaluation should be conducted before deploying ML-based CDSSs in clinical settings. The results highlight the importance of considering supported human decisions as end points rather than merely the stand-alone CDSSs outputs. American Medical Association 2021-03-11 /pmc/articles/PMC7953308/ /pubmed/33704476 http://dx.doi.org/10.1001/jamanetworkopen.2021.1276 Text en Copyright 2021 Vasey B et al. JAMA Network Open. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the CC-BY License.
spellingShingle	Original Investigation Vasey, Baptiste Ursprung, Stephan Beddoe, Benjamin Taylor, Elliott H. Marlow, Neale Bilbro, Nicole Watkinson, Peter McCulloch, Peter Association of Clinician Diagnostic Performance With Machine Learning–Based Decision Support Systems: A Systematic Review
title	Association of Clinician Diagnostic Performance With Machine Learning–Based Decision Support Systems: A Systematic Review
title_full	Association of Clinician Diagnostic Performance With Machine Learning–Based Decision Support Systems: A Systematic Review
title_fullStr	Association of Clinician Diagnostic Performance With Machine Learning–Based Decision Support Systems: A Systematic Review
title_full_unstemmed	Association of Clinician Diagnostic Performance With Machine Learning–Based Decision Support Systems: A Systematic Review
title_short	Association of Clinician Diagnostic Performance With Machine Learning–Based Decision Support Systems: A Systematic Review
title_sort	association of clinician diagnostic performance with machine learning–based decision support systems: a systematic review
topic	Original Investigation
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7953308/ https://www.ncbi.nlm.nih.gov/pubmed/33704476 http://dx.doi.org/10.1001/jamanetworkopen.2021.1276
work_keys_str_mv	AT vaseybaptiste associationofcliniciandiagnosticperformancewithmachinelearningbaseddecisionsupportsystemsasystematicreview AT ursprungstephan associationofcliniciandiagnosticperformancewithmachinelearningbaseddecisionsupportsystemsasystematicreview AT beddoebenjamin associationofcliniciandiagnosticperformancewithmachinelearningbaseddecisionsupportsystemsasystematicreview AT taylorelliotth associationofcliniciandiagnosticperformancewithmachinelearningbaseddecisionsupportsystemsasystematicreview AT marlowneale associationofcliniciandiagnosticperformancewithmachinelearningbaseddecisionsupportsystemsasystematicreview AT bilbronicole associationofcliniciandiagnosticperformancewithmachinelearningbaseddecisionsupportsystemsasystematicreview AT watkinsonpeter associationofcliniciandiagnosticperformancewithmachinelearningbaseddecisionsupportsystemsasystematicreview AT mccullochpeter associationofcliniciandiagnosticperformancewithmachinelearningbaseddecisionsupportsystemsasystematicreview

Association of Clinician Diagnostic Performance With Machine Learning–Based Decision Support Systems: A Systematic Review

Ejemplares similares