Cargando…

An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography

An international reader study was conducted to gauge an average diagnostic accuracy of radiologists interpreting chest X-ray images, including those from fluorography and mammography, and establish requirements for stand-alone radiological artificial intelligence (AI) models. The retrospective studi...

Descripción completa

Detalles Bibliográficos
Autores principales: Arzamasov, Kirill, Vasilev, Yuriy, Vladzymyrskyy, Anton, Omelyanskaya, Olga, Shulkin, Igor, Kozikhina, Darya, Goncharova, Inna, Gelezhe, Pavel, Kirpichev, Yury, Bobrovskaya, Tatiana, Andreychenko, Anna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10298418/
https://www.ncbi.nlm.nih.gov/pubmed/37372802
http://dx.doi.org/10.3390/healthcare11121684
_version_ 1785064110256291840
author Arzamasov, Kirill
Vasilev, Yuriy
Vladzymyrskyy, Anton
Omelyanskaya, Olga
Shulkin, Igor
Kozikhina, Darya
Goncharova, Inna
Gelezhe, Pavel
Kirpichev, Yury
Bobrovskaya, Tatiana
Andreychenko, Anna
author_facet Arzamasov, Kirill
Vasilev, Yuriy
Vladzymyrskyy, Anton
Omelyanskaya, Olga
Shulkin, Igor
Kozikhina, Darya
Goncharova, Inna
Gelezhe, Pavel
Kirpichev, Yury
Bobrovskaya, Tatiana
Andreychenko, Anna
author_sort Arzamasov, Kirill
collection PubMed
description An international reader study was conducted to gauge an average diagnostic accuracy of radiologists interpreting chest X-ray images, including those from fluorography and mammography, and establish requirements for stand-alone radiological artificial intelligence (AI) models. The retrospective studies in the datasets were labelled as containing or not containing target pathological findings based on a consensus of two experienced radiologists, and the results of a laboratory test and follow-up examination, where applicable. A total of 204 radiologists from 11 countries with various experience performed an assessment of the dataset with a 5-point Likert scale via a web platform. Eight commercial radiological AI models analyzed the same dataset. The AI AUROC was 0.87 (95% CI:0.83–0.9) versus 0.96 (95% CI 0.94–0.97) for radiologists. The sensitivity and specificity of AI versus radiologists were 0.71 (95% CI 0.64–0.78) versus 0.91 (95% CI 0.86–0.95) and 0.93 (95% CI 0.89–0.96) versus 0.9 (95% CI 0.85–0.94) for AI. The overall diagnostic accuracy of radiologists was superior to AI for chest X-ray and mammography. However, the accuracy of AI was noninferior to the least experienced radiologists for mammography and fluorography, and to all radiologists for chest X-ray. Therefore, an AI-based first reading could be recommended to reduce the workload burden of radiologists for the most common radiological studies such as chest X-ray and mammography.
format Online
Article
Text
id pubmed-10298418
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102984182023-06-28 An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography Arzamasov, Kirill Vasilev, Yuriy Vladzymyrskyy, Anton Omelyanskaya, Olga Shulkin, Igor Kozikhina, Darya Goncharova, Inna Gelezhe, Pavel Kirpichev, Yury Bobrovskaya, Tatiana Andreychenko, Anna Healthcare (Basel) Article An international reader study was conducted to gauge an average diagnostic accuracy of radiologists interpreting chest X-ray images, including those from fluorography and mammography, and establish requirements for stand-alone radiological artificial intelligence (AI) models. The retrospective studies in the datasets were labelled as containing or not containing target pathological findings based on a consensus of two experienced radiologists, and the results of a laboratory test and follow-up examination, where applicable. A total of 204 radiologists from 11 countries with various experience performed an assessment of the dataset with a 5-point Likert scale via a web platform. Eight commercial radiological AI models analyzed the same dataset. The AI AUROC was 0.87 (95% CI:0.83–0.9) versus 0.96 (95% CI 0.94–0.97) for radiologists. The sensitivity and specificity of AI versus radiologists were 0.71 (95% CI 0.64–0.78) versus 0.91 (95% CI 0.86–0.95) and 0.93 (95% CI 0.89–0.96) versus 0.9 (95% CI 0.85–0.94) for AI. The overall diagnostic accuracy of radiologists was superior to AI for chest X-ray and mammography. However, the accuracy of AI was noninferior to the least experienced radiologists for mammography and fluorography, and to all radiologists for chest X-ray. Therefore, an AI-based first reading could be recommended to reduce the workload burden of radiologists for the most common radiological studies such as chest X-ray and mammography. MDPI 2023-06-08 /pmc/articles/PMC10298418/ /pubmed/37372802 http://dx.doi.org/10.3390/healthcare11121684 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Arzamasov, Kirill
Vasilev, Yuriy
Vladzymyrskyy, Anton
Omelyanskaya, Olga
Shulkin, Igor
Kozikhina, Darya
Goncharova, Inna
Gelezhe, Pavel
Kirpichev, Yury
Bobrovskaya, Tatiana
Andreychenko, Anna
An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography
title An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography
title_full An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography
title_fullStr An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography
title_full_unstemmed An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography
title_short An International Non-Inferiority Study for the Benchmarking of AI for Routine Radiology Cases: Chest X-ray, Fluorography and Mammography
title_sort international non-inferiority study for the benchmarking of ai for routine radiology cases: chest x-ray, fluorography and mammography
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10298418/
https://www.ncbi.nlm.nih.gov/pubmed/37372802
http://dx.doi.org/10.3390/healthcare11121684
work_keys_str_mv AT arzamasovkirill aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT vasilevyuriy aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT vladzymyrskyyanton aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT omelyanskayaolga aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT shulkinigor aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT kozikhinadarya aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT goncharovainna aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT gelezhepavel aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT kirpichevyury aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT bobrovskayatatiana aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT andreychenkoanna aninternationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT arzamasovkirill internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT vasilevyuriy internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT vladzymyrskyyanton internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT omelyanskayaolga internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT shulkinigor internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT kozikhinadarya internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT goncharovainna internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT gelezhepavel internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT kirpichevyury internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT bobrovskayatatiana internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography
AT andreychenkoanna internationalnoninferioritystudyforthebenchmarkingofaiforroutineradiologycaseschestxrayfluorographyandmammography