Cargando…

Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications

Radiologists today play a central role in making diagnostic decisions and labeling images for training and benchmarking artificial intelligence (AI) algorithms. A key concern is low inter-reader reliability (IRR) seen between experts when interpreting challenging cases. While team-based decisions ar...

Descripción completa

Detalles Bibliográficos
Autores principales: Shah, Rutwik, Astuto Arouche Nunes, Bruno, Gleason, Tyler, Fletcher, Will, Banaga, Justin, Sweetwood, Kevin, Ye, Allen, Patel, Rina, McGill, Kevin, Link, Thomas, Crane, Jason, Pedoia, Valentina, Majumdar, Sharmila
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10039189/
https://www.ncbi.nlm.nih.gov/pubmed/36414832
http://dx.doi.org/10.1007/s10278-022-00662-3
_version_ 1784912219224408064
author Shah, Rutwik
Astuto Arouche Nunes, Bruno
Gleason, Tyler
Fletcher, Will
Banaga, Justin
Sweetwood, Kevin
Ye, Allen
Patel, Rina
McGill, Kevin
Link, Thomas
Crane, Jason
Pedoia, Valentina
Majumdar, Sharmila
author_facet Shah, Rutwik
Astuto Arouche Nunes, Bruno
Gleason, Tyler
Fletcher, Will
Banaga, Justin
Sweetwood, Kevin
Ye, Allen
Patel, Rina
McGill, Kevin
Link, Thomas
Crane, Jason
Pedoia, Valentina
Majumdar, Sharmila
author_sort Shah, Rutwik
collection PubMed
description Radiologists today play a central role in making diagnostic decisions and labeling images for training and benchmarking artificial intelligence (AI) algorithms. A key concern is low inter-reader reliability (IRR) seen between experts when interpreting challenging cases. While team-based decisions are known to outperform individual decisions, inter-personal biases often creep up in group interactions which limit nondominant participants from expressing true opinions. To overcome the dual problems of low consensus and interpersonal bias, we explored a solution modeled on bee swarms. Two separate cohorts, three board-certified radiologists, (cohort 1), and five radiology residents (cohort 2) collaborated on a digital swarm platform in real time and in a blinded fashion, grading meniscal lesions on knee MR exams. These consensus votes were benchmarked against clinical (arthroscopy) and radiological (senior-most radiologist) standards of reference using Cohen’s kappa. The IRR of the consensus votes was then compared to the IRR of the majority and most confident votes of the two cohorts. IRR was also calculated for predictions from a meniscal lesion detecting AI algorithm. The attending cohort saw an improvement of 23% in IRR of swarm votes (k = 0.34) over majority vote (k = 0.11). Similar improvement of 23% in IRR (k = 0.25) in 3-resident swarm votes over majority vote (k = 0.02) was observed. The 5-resident swarm had an even higher improvement of 30% in IRR (k = 0.37) over majority vote (k = 0.07). The swarm consensus votes outperformed individual and majority vote decision in both the radiologists and resident cohorts. The attending and resident swarms also outperformed predictions from a state-of-the-art AI algorithm. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s10278-022-00662-3.
format Online
Article
Text
id pubmed-10039189
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-100391892023-03-26 Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications Shah, Rutwik Astuto Arouche Nunes, Bruno Gleason, Tyler Fletcher, Will Banaga, Justin Sweetwood, Kevin Ye, Allen Patel, Rina McGill, Kevin Link, Thomas Crane, Jason Pedoia, Valentina Majumdar, Sharmila J Digit Imaging Article Radiologists today play a central role in making diagnostic decisions and labeling images for training and benchmarking artificial intelligence (AI) algorithms. A key concern is low inter-reader reliability (IRR) seen between experts when interpreting challenging cases. While team-based decisions are known to outperform individual decisions, inter-personal biases often creep up in group interactions which limit nondominant participants from expressing true opinions. To overcome the dual problems of low consensus and interpersonal bias, we explored a solution modeled on bee swarms. Two separate cohorts, three board-certified radiologists, (cohort 1), and five radiology residents (cohort 2) collaborated on a digital swarm platform in real time and in a blinded fashion, grading meniscal lesions on knee MR exams. These consensus votes were benchmarked against clinical (arthroscopy) and radiological (senior-most radiologist) standards of reference using Cohen’s kappa. The IRR of the consensus votes was then compared to the IRR of the majority and most confident votes of the two cohorts. IRR was also calculated for predictions from a meniscal lesion detecting AI algorithm. The attending cohort saw an improvement of 23% in IRR of swarm votes (k = 0.34) over majority vote (k = 0.11). Similar improvement of 23% in IRR (k = 0.25) in 3-resident swarm votes over majority vote (k = 0.02) was observed. The 5-resident swarm had an even higher improvement of 30% in IRR (k = 0.37) over majority vote (k = 0.07). The swarm consensus votes outperformed individual and majority vote decision in both the radiologists and resident cohorts. The attending and resident swarms also outperformed predictions from a state-of-the-art AI algorithm. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s10278-022-00662-3. Springer International Publishing 2022-11-22 2023-04 /pmc/articles/PMC10039189/ /pubmed/36414832 http://dx.doi.org/10.1007/s10278-022-00662-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Shah, Rutwik
Astuto Arouche Nunes, Bruno
Gleason, Tyler
Fletcher, Will
Banaga, Justin
Sweetwood, Kevin
Ye, Allen
Patel, Rina
McGill, Kevin
Link, Thomas
Crane, Jason
Pedoia, Valentina
Majumdar, Sharmila
Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications
title Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications
title_full Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications
title_fullStr Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications
title_full_unstemmed Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications
title_short Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications
title_sort utilizing a digital swarm intelligence platform to improve consensus among radiologists and exploring its applications
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10039189/
https://www.ncbi.nlm.nih.gov/pubmed/36414832
http://dx.doi.org/10.1007/s10278-022-00662-3
work_keys_str_mv AT shahrutwik utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications
AT astutoarouchenunesbruno utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications
AT gleasontyler utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications
AT fletcherwill utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications
AT banagajustin utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications
AT sweetwoodkevin utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications
AT yeallen utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications
AT patelrina utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications
AT mcgillkevin utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications
AT linkthomas utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications
AT cranejason utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications
AT pedoiavalentina utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications
AT majumdarsharmila utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications