Cargando…
Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications
Radiologists today play a central role in making diagnostic decisions and labeling images for training and benchmarking artificial intelligence (AI) algorithms. A key concern is low inter-reader reliability (IRR) seen between experts when interpreting challenging cases. While team-based decisions ar...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10039189/ https://www.ncbi.nlm.nih.gov/pubmed/36414832 http://dx.doi.org/10.1007/s10278-022-00662-3 |
_version_ | 1784912219224408064 |
---|---|
author | Shah, Rutwik Astuto Arouche Nunes, Bruno Gleason, Tyler Fletcher, Will Banaga, Justin Sweetwood, Kevin Ye, Allen Patel, Rina McGill, Kevin Link, Thomas Crane, Jason Pedoia, Valentina Majumdar, Sharmila |
author_facet | Shah, Rutwik Astuto Arouche Nunes, Bruno Gleason, Tyler Fletcher, Will Banaga, Justin Sweetwood, Kevin Ye, Allen Patel, Rina McGill, Kevin Link, Thomas Crane, Jason Pedoia, Valentina Majumdar, Sharmila |
author_sort | Shah, Rutwik |
collection | PubMed |
description | Radiologists today play a central role in making diagnostic decisions and labeling images for training and benchmarking artificial intelligence (AI) algorithms. A key concern is low inter-reader reliability (IRR) seen between experts when interpreting challenging cases. While team-based decisions are known to outperform individual decisions, inter-personal biases often creep up in group interactions which limit nondominant participants from expressing true opinions. To overcome the dual problems of low consensus and interpersonal bias, we explored a solution modeled on bee swarms. Two separate cohorts, three board-certified radiologists, (cohort 1), and five radiology residents (cohort 2) collaborated on a digital swarm platform in real time and in a blinded fashion, grading meniscal lesions on knee MR exams. These consensus votes were benchmarked against clinical (arthroscopy) and radiological (senior-most radiologist) standards of reference using Cohen’s kappa. The IRR of the consensus votes was then compared to the IRR of the majority and most confident votes of the two cohorts. IRR was also calculated for predictions from a meniscal lesion detecting AI algorithm. The attending cohort saw an improvement of 23% in IRR of swarm votes (k = 0.34) over majority vote (k = 0.11). Similar improvement of 23% in IRR (k = 0.25) in 3-resident swarm votes over majority vote (k = 0.02) was observed. The 5-resident swarm had an even higher improvement of 30% in IRR (k = 0.37) over majority vote (k = 0.07). The swarm consensus votes outperformed individual and majority vote decision in both the radiologists and resident cohorts. The attending and resident swarms also outperformed predictions from a state-of-the-art AI algorithm. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s10278-022-00662-3. |
format | Online Article Text |
id | pubmed-10039189 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-100391892023-03-26 Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications Shah, Rutwik Astuto Arouche Nunes, Bruno Gleason, Tyler Fletcher, Will Banaga, Justin Sweetwood, Kevin Ye, Allen Patel, Rina McGill, Kevin Link, Thomas Crane, Jason Pedoia, Valentina Majumdar, Sharmila J Digit Imaging Article Radiologists today play a central role in making diagnostic decisions and labeling images for training and benchmarking artificial intelligence (AI) algorithms. A key concern is low inter-reader reliability (IRR) seen between experts when interpreting challenging cases. While team-based decisions are known to outperform individual decisions, inter-personal biases often creep up in group interactions which limit nondominant participants from expressing true opinions. To overcome the dual problems of low consensus and interpersonal bias, we explored a solution modeled on bee swarms. Two separate cohorts, three board-certified radiologists, (cohort 1), and five radiology residents (cohort 2) collaborated on a digital swarm platform in real time and in a blinded fashion, grading meniscal lesions on knee MR exams. These consensus votes were benchmarked against clinical (arthroscopy) and radiological (senior-most radiologist) standards of reference using Cohen’s kappa. The IRR of the consensus votes was then compared to the IRR of the majority and most confident votes of the two cohorts. IRR was also calculated for predictions from a meniscal lesion detecting AI algorithm. The attending cohort saw an improvement of 23% in IRR of swarm votes (k = 0.34) over majority vote (k = 0.11). Similar improvement of 23% in IRR (k = 0.25) in 3-resident swarm votes over majority vote (k = 0.02) was observed. The 5-resident swarm had an even higher improvement of 30% in IRR (k = 0.37) over majority vote (k = 0.07). The swarm consensus votes outperformed individual and majority vote decision in both the radiologists and resident cohorts. The attending and resident swarms also outperformed predictions from a state-of-the-art AI algorithm. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s10278-022-00662-3. Springer International Publishing 2022-11-22 2023-04 /pmc/articles/PMC10039189/ /pubmed/36414832 http://dx.doi.org/10.1007/s10278-022-00662-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Shah, Rutwik Astuto Arouche Nunes, Bruno Gleason, Tyler Fletcher, Will Banaga, Justin Sweetwood, Kevin Ye, Allen Patel, Rina McGill, Kevin Link, Thomas Crane, Jason Pedoia, Valentina Majumdar, Sharmila Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications |
title | Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications |
title_full | Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications |
title_fullStr | Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications |
title_full_unstemmed | Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications |
title_short | Utilizing a Digital Swarm Intelligence Platform to Improve Consensus Among Radiologists and Exploring Its Applications |
title_sort | utilizing a digital swarm intelligence platform to improve consensus among radiologists and exploring its applications |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10039189/ https://www.ncbi.nlm.nih.gov/pubmed/36414832 http://dx.doi.org/10.1007/s10278-022-00662-3 |
work_keys_str_mv | AT shahrutwik utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications AT astutoarouchenunesbruno utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications AT gleasontyler utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications AT fletcherwill utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications AT banagajustin utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications AT sweetwoodkevin utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications AT yeallen utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications AT patelrina utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications AT mcgillkevin utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications AT linkthomas utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications AT cranejason utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications AT pedoiavalentina utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications AT majumdarsharmila utilizingadigitalswarmintelligenceplatformtoimproveconsensusamongradiologistsandexploringitsapplications |