Cargando…
An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography
An international reader study was conducted to gauge an average diagnostic accuracy of radiologists interpreting chest X-ray images, including those from fluorography and mammography, and establish requirements for stand-alone radiological artificial intelligence (AI) models. The retrospective studi...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10298418/ https://www.ncbi.nlm.nih.gov/pubmed/37372802 http://dx.doi.org/10.3390/healthcare11121684 |
_version_ | 1785064110256291840 |
---|---|
author | Arzamasov, Kirill Vasilev, Yuriy Vladzymyrskyy, Anton Omelyanskaya, Olga Shulkin, Igor Kozikhina, Darya Goncharova, Inna Gelezhe, Pavel Kirpichev, Yury Bobrovskaya, Tatiana Andreychenko, Anna |
author_facet | Arzamasov, Kirill Vasilev, Yuriy Vladzymyrskyy, Anton Omelyanskaya, Olga Shulkin, Igor Kozikhina, Darya Goncharova, Inna Gelezhe, Pavel Kirpichev, Yury Bobrovskaya, Tatiana Andreychenko, Anna |
author_sort | Arzamasov, Kirill |
collection | PubMed |
description | An international reader study was conducted to gauge an average diagnostic accuracy of radiologists interpreting chest X-ray images, including those from fluorography and mammography, and establish requirements for stand-alone radiological artificial intelligence (AI) models. The retrospective studies in the datasets were labelled as containing or not containing target pathological findings based on a consensus of two experienced radiologists, and the results of a laboratory test and follow-up examination, where applicable. A total of 204 radiologists from 11 countries with various experience performed an assessment of the dataset with a 5-point Likert scale via a web platform. Eight commercial radiological AI models analyzed the same dataset. The AI AUROC was 0.87 (95% CI:0.83–0.9) versus 0.96 (95% CI 0.94–0.97) for radiologists. The sensitivity and specificity of AI versus radiologists were 0.71 (95% CI 0.64–0.78) versus 0.91 (95% CI 0.86–0.95) and 0.93 (95% CI 0.89–0.96) versus 0.9 (95% CI 0.85–0.94) for AI. The overall diagnostic accuracy of radiologists was superior to AI for chest X-ray and mammography. However, the accuracy of AI was noninferior to the least experienced radiologists for mammography and fluorography, and to all radiologists for chest X-ray. Therefore, an AI-based first reading could be recommended to reduce the workload burden of radiologists for the most common radiological studies such as chest X-ray and mammography. |
format | Online Article Text |
id | pubmed-10298418 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-102984182023-06-28 An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography Arzamasov, Kirill Vasilev, Yuriy Vladzymyrskyy, Anton Omelyanskaya, Olga Shulkin, Igor Kozikhina, Darya Goncharova, Inna Gelezhe, Pavel Kirpichev, Yury Bobrovskaya, Tatiana Andreychenko, Anna Healthcare (Basel) Article An international reader study was conducted to gauge an average diagnostic accuracy of radiologists interpreting chest X-ray images, including those from fluorography and mammography, and establish requirements for stand-alone radiological artificial intelligence (AI) models. The retrospective studies in the datasets were labelled as containing or not containing target pathological findings based on a consensus of two experienced radiologists, and the results of a laboratory test and follow-up examination, where applicable. A total of 204 radiologists from 11 countries with various experience performed an assessment of the dataset with a 5-point Likert scale via a web platform. Eight commercial radiological AI models analyzed the same dataset. The AI AUROC was 0.87 (95% CI:0.83–0.9) versus 0.96 (95% CI 0.94–0.97) for radiologists. The sensitivity and specificity of AI versus radiologists were 0.71 (95% CI 0.64–0.78) versus 0.91 (95% CI 0.86–0.95) and 0.93 (95% CI 0.89–0.96) versus 0.9 (95% CI 0.85–0.94) for AI. The overall diagnostic accuracy of radiologists was superior to AI for chest X-ray and mammography. However, the accuracy of AI was noninferior to the least experienced radiologists for mammography and fluorography, and to all radiologists for chest X-ray. Therefore, an AI-based first reading could be recommended to reduce the workload burden of radiologists for the most common radiological studies such as chest X-ray and mammography. MDPI 2023-06-08 /pmc/articles/PMC10298418/ /pubmed/37372802 http://dx.doi.org/10.3390/healthcare11121684 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Arzamasov, Kirill Vasilev, Yuriy Vladzymyrskyy, Anton Omelyanskaya, Olga Shulkin, Igor Kozikhina, Darya Goncharova, Inna Gelezhe, Pavel Kirpichev, Yury Bobrovskaya, Tatiana Andreychenko, Anna An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography |
title | An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography |
title_full | An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography |
title_fullStr | An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography |
title_full_unstemmed | An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography |
title_short | An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography |
title_sort | international non-inferiority study for the benchmarking of ai for routine radiology cases: chest x-ray, fluorography and mammography |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10298418/ https://www.ncbi.nlm.nih.gov/pubmed/37372802 http://dx.doi.org/10.3390/healthcare11121684 |
work_keys_str_mv | AT arzamasovkirill aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT vasilevyuriy aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT vladzymyrskyyanton aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT omelyanskayaolga aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT shulkinigor aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT kozikhinadarya aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT goncharovainna aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT gelezhepavel aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT kirpichevyury aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT bobrovskayatatiana aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT andreychenkoanna aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT arzamasovkirill internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT vasilevyuriy internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT vladzymyrskyyanton internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT omelyanskayaolga internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT shulkinigor internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT kozikhinadarya internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT goncharovainna internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT gelezhepavel internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT kirpichevyury internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT bobrovskayatatiana internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography AT andreychenkoanna internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography |