Cargando…

Detecting Spurious Correlations With Sanity Tests for Artificial Intelligence Guided Radiology Systems

Artificial intelligence (AI) has been successful at solving numerous problems in machine perception. In radiology, AI systems are rapidly evolving and show progress in guiding treatment decisions, diagnosing, localizing disease on medical images, and improving radiologists' efficiency. A critic...

Descripción completa

Detalles Bibliográficos
Autores principales: Mahmood, Usman, Shrestha, Robik, Bates, David D. B., Mannelli, Lorenzo, Corrias, Giuseppe, Erdi, Yusuf Emre, Kanan, Christopher
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8521929/
https://www.ncbi.nlm.nih.gov/pubmed/34713144
http://dx.doi.org/10.3389/fdgth.2021.671015
_version_ 1784584989483991040
author Mahmood, Usman
Shrestha, Robik
Bates, David D. B.
Mannelli, Lorenzo
Corrias, Giuseppe
Erdi, Yusuf Emre
Kanan, Christopher
author_facet Mahmood, Usman
Shrestha, Robik
Bates, David D. B.
Mannelli, Lorenzo
Corrias, Giuseppe
Erdi, Yusuf Emre
Kanan, Christopher
author_sort Mahmood, Usman
collection PubMed
description Artificial intelligence (AI) has been successful at solving numerous problems in machine perception. In radiology, AI systems are rapidly evolving and show progress in guiding treatment decisions, diagnosing, localizing disease on medical images, and improving radiologists' efficiency. A critical component to deploying AI in radiology is to gain confidence in a developed system's efficacy and safety. The current gold standard approach is to conduct an analytical validation of performance on a generalization dataset from one or more institutions, followed by a clinical validation study of the system's efficacy during deployment. Clinical validation studies are time-consuming, and best practices dictate limited re-use of analytical validation data, so it is ideal to know ahead of time if a system is likely to fail analytical or clinical validation. In this paper, we describe a series of sanity tests to identify when a system performs well on development data for the wrong reasons. We illustrate the sanity tests' value by designing a deep learning system to classify pancreatic cancer seen in computed tomography scans.
format Online
Article
Text
id pubmed-8521929
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-85219292021-10-27 Detecting Spurious Correlations With Sanity Tests for Artificial Intelligence Guided Radiology Systems Mahmood, Usman Shrestha, Robik Bates, David D. B. Mannelli, Lorenzo Corrias, Giuseppe Erdi, Yusuf Emre Kanan, Christopher Front Digit Health Digital Health Artificial intelligence (AI) has been successful at solving numerous problems in machine perception. In radiology, AI systems are rapidly evolving and show progress in guiding treatment decisions, diagnosing, localizing disease on medical images, and improving radiologists' efficiency. A critical component to deploying AI in radiology is to gain confidence in a developed system's efficacy and safety. The current gold standard approach is to conduct an analytical validation of performance on a generalization dataset from one or more institutions, followed by a clinical validation study of the system's efficacy during deployment. Clinical validation studies are time-consuming, and best practices dictate limited re-use of analytical validation data, so it is ideal to know ahead of time if a system is likely to fail analytical or clinical validation. In this paper, we describe a series of sanity tests to identify when a system performs well on development data for the wrong reasons. We illustrate the sanity tests' value by designing a deep learning system to classify pancreatic cancer seen in computed tomography scans. Frontiers Media S.A. 2021-08-03 /pmc/articles/PMC8521929/ /pubmed/34713144 http://dx.doi.org/10.3389/fdgth.2021.671015 Text en Copyright © 2021 Mahmood, Shrestha, Bates, Mannelli, Corrias, Erdi and Kanan. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Digital Health
Mahmood, Usman
Shrestha, Robik
Bates, David D. B.
Mannelli, Lorenzo
Corrias, Giuseppe
Erdi, Yusuf Emre
Kanan, Christopher
Detecting Spurious Correlations With Sanity Tests for Artificial Intelligence Guided Radiology Systems
title Detecting Spurious Correlations With Sanity Tests for Artificial Intelligence Guided Radiology Systems
title_full Detecting Spurious Correlations With Sanity Tests for Artificial Intelligence Guided Radiology Systems
title_fullStr Detecting Spurious Correlations With Sanity Tests for Artificial Intelligence Guided Radiology Systems
title_full_unstemmed Detecting Spurious Correlations With Sanity Tests for Artificial Intelligence Guided Radiology Systems
title_short Detecting Spurious Correlations With Sanity Tests for Artificial Intelligence Guided Radiology Systems
title_sort detecting spurious correlations with sanity tests for artificial intelligence guided radiology systems
topic Digital Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8521929/
https://www.ncbi.nlm.nih.gov/pubmed/34713144
http://dx.doi.org/10.3389/fdgth.2021.671015
work_keys_str_mv AT mahmoodusman detectingspuriouscorrelationswithsanitytestsforartificialintelligenceguidedradiologysystems
AT shrestharobik detectingspuriouscorrelationswithsanitytestsforartificialintelligenceguidedradiologysystems
AT batesdaviddb detectingspuriouscorrelationswithsanitytestsforartificialintelligenceguidedradiologysystems
AT mannellilorenzo detectingspuriouscorrelationswithsanitytestsforartificialintelligenceguidedradiologysystems
AT corriasgiuseppe detectingspuriouscorrelationswithsanitytestsforartificialintelligenceguidedradiologysystems
AT erdiyusufemre detectingspuriouscorrelationswithsanitytestsforartificialintelligenceguidedradiologysystems
AT kananchristopher detectingspuriouscorrelationswithsanitytestsforartificialintelligenceguidedradiologysystems