Cargando…

Can AI distinguish a bone radiograph from photos of flowers or cars? Evaluation of bone age deep learning model on inappropriate data inputs

OBJECTIVE: To evaluate the behavior of a publicly available deep convolutional neural network (DCNN) bone age algorithm when presented with inappropriate data inputs in both radiological and non-radiological domains. METHODS: We evaluated a publicly available DCNN-based bone age application. The DCN...

Descripción completa

Detalles Bibliográficos
Autores principales: Yi, Paul H., Arun, Anirudh, Hafezi-Nejad, Nima, Choy, Garry, Sair, Haris I., Hui, Ferdinand K., Fritz, Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8339162/
https://www.ncbi.nlm.nih.gov/pubmed/34351456
http://dx.doi.org/10.1007/s00256-021-03880-y
_version_ 1783733539278159872
author Yi, Paul H.
Arun, Anirudh
Hafezi-Nejad, Nima
Choy, Garry
Sair, Haris I.
Hui, Ferdinand K.
Fritz, Jan
author_facet Yi, Paul H.
Arun, Anirudh
Hafezi-Nejad, Nima
Choy, Garry
Sair, Haris I.
Hui, Ferdinand K.
Fritz, Jan
author_sort Yi, Paul H.
collection PubMed
description OBJECTIVE: To evaluate the behavior of a publicly available deep convolutional neural network (DCNN) bone age algorithm when presented with inappropriate data inputs in both radiological and non-radiological domains. METHODS: We evaluated a publicly available DCNN-based bone age application. The DCNN was trained on 12,612 pediatric hand radiographs and won the 2017 RSNA Pediatric Bone Age Challenge (concordance of 0.991 with radiologist ground-truth). We used the application to analyze 50 left-hand radiographs (appropriate data inputs) and seven classes of inappropriate data inputs in radiological (i.e., chest radiographs) and non-radiological (i.e., image of street numbers) domains. For each image, we noted if (1) the application distinguished between appropriate and inappropriate data inputs and (2) inference time per image. Mean inference times were compared using ANOVA. RESULTS: The 16Bit Bone Age application calculated bone age for all pediatric hand radiographs with mean inference time of 1.1 s. The application did not distinguish between pediatric hand radiographs and inappropriate image types, including radiological and non-radiological domains. The application inappropriately calculated bone age for all inappropriate image types, with mean inference time of 1.1 s for all categories (p = 1). CONCLUSION: A publicly available DCNN-based bone age application failed to distinguish between appropriate and inappropriate data inputs and calculated bone age for inappropriate images. The awareness of inappropriate outputs based on inappropriate DCNN input is important if tasks such as bone age determination are automated, emphasizing the need for appropriate oversight at the data input and verification stage to avoid unrecognized erroneous results.
format Online
Article
Text
id pubmed-8339162
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-83391622021-08-06 Can AI distinguish a bone radiograph from photos of flowers or cars? Evaluation of bone age deep learning model on inappropriate data inputs Yi, Paul H. Arun, Anirudh Hafezi-Nejad, Nima Choy, Garry Sair, Haris I. Hui, Ferdinand K. Fritz, Jan Skeletal Radiol Scientific Article OBJECTIVE: To evaluate the behavior of a publicly available deep convolutional neural network (DCNN) bone age algorithm when presented with inappropriate data inputs in both radiological and non-radiological domains. METHODS: We evaluated a publicly available DCNN-based bone age application. The DCNN was trained on 12,612 pediatric hand radiographs and won the 2017 RSNA Pediatric Bone Age Challenge (concordance of 0.991 with radiologist ground-truth). We used the application to analyze 50 left-hand radiographs (appropriate data inputs) and seven classes of inappropriate data inputs in radiological (i.e., chest radiographs) and non-radiological (i.e., image of street numbers) domains. For each image, we noted if (1) the application distinguished between appropriate and inappropriate data inputs and (2) inference time per image. Mean inference times were compared using ANOVA. RESULTS: The 16Bit Bone Age application calculated bone age for all pediatric hand radiographs with mean inference time of 1.1 s. The application did not distinguish between pediatric hand radiographs and inappropriate image types, including radiological and non-radiological domains. The application inappropriately calculated bone age for all inappropriate image types, with mean inference time of 1.1 s for all categories (p = 1). CONCLUSION: A publicly available DCNN-based bone age application failed to distinguish between appropriate and inappropriate data inputs and calculated bone age for inappropriate images. The awareness of inappropriate outputs based on inappropriate DCNN input is important if tasks such as bone age determination are automated, emphasizing the need for appropriate oversight at the data input and verification stage to avoid unrecognized erroneous results. Springer Berlin Heidelberg 2021-08-05 2022 /pmc/articles/PMC8339162/ /pubmed/34351456 http://dx.doi.org/10.1007/s00256-021-03880-y Text en © ISS 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Scientific Article
Yi, Paul H.
Arun, Anirudh
Hafezi-Nejad, Nima
Choy, Garry
Sair, Haris I.
Hui, Ferdinand K.
Fritz, Jan
Can AI distinguish a bone radiograph from photos of flowers or cars? Evaluation of bone age deep learning model on inappropriate data inputs
title Can AI distinguish a bone radiograph from photos of flowers or cars? Evaluation of bone age deep learning model on inappropriate data inputs
title_full Can AI distinguish a bone radiograph from photos of flowers or cars? Evaluation of bone age deep learning model on inappropriate data inputs
title_fullStr Can AI distinguish a bone radiograph from photos of flowers or cars? Evaluation of bone age deep learning model on inappropriate data inputs
title_full_unstemmed Can AI distinguish a bone radiograph from photos of flowers or cars? Evaluation of bone age deep learning model on inappropriate data inputs
title_short Can AI distinguish a bone radiograph from photos of flowers or cars? Evaluation of bone age deep learning model on inappropriate data inputs
title_sort can ai distinguish a bone radiograph from photos of flowers or cars? evaluation of bone age deep learning model on inappropriate data inputs
topic Scientific Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8339162/
https://www.ncbi.nlm.nih.gov/pubmed/34351456
http://dx.doi.org/10.1007/s00256-021-03880-y
work_keys_str_mv AT yipaulh canaidistinguishaboneradiographfromphotosofflowersorcarsevaluationofboneagedeeplearningmodeloninappropriatedatainputs
AT arunanirudh canaidistinguishaboneradiographfromphotosofflowersorcarsevaluationofboneagedeeplearningmodeloninappropriatedatainputs
AT hafezinejadnima canaidistinguishaboneradiographfromphotosofflowersorcarsevaluationofboneagedeeplearningmodeloninappropriatedatainputs
AT choygarry canaidistinguishaboneradiographfromphotosofflowersorcarsevaluationofboneagedeeplearningmodeloninappropriatedatainputs
AT sairharisi canaidistinguishaboneradiographfromphotosofflowersorcarsevaluationofboneagedeeplearningmodeloninappropriatedatainputs
AT huiferdinandk canaidistinguishaboneradiographfromphotosofflowersorcarsevaluationofboneagedeeplearningmodeloninappropriatedatainputs
AT fritzjan canaidistinguishaboneradiographfromphotosofflowersorcarsevaluationofboneagedeeplearningmodeloninappropriatedatainputs