Cargando…

Critical evaluation of deep neural networks for wrist fracture detection

Wrist Fracture is the most common type of fracture with a high incidence rate. Conventional radiography (i.e. X-ray imaging) is used for wrist fracture detection routinely, but occasionally fracture delineation poses issues and an additional confirmation by computed tomography (CT) is needed for dia...

Descripción completa

Detalles Bibliográficos
Autores principales: Raisuddin, Abu Mohammed, Vaattovaara, Elias, Nevalainen, Mika, Nikki, Marko, Järvenpää, Elina, Makkonen, Kaisa, Pinola, Pekka, Palsio, Tuula, Niemensivu, Arttu, Tervonen, Osmo, Tiulpin, Aleksei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7971048/
https://www.ncbi.nlm.nih.gov/pubmed/33727668
http://dx.doi.org/10.1038/s41598-021-85570-2
_version_ 1783666542556217344
author Raisuddin, Abu Mohammed
Vaattovaara, Elias
Nevalainen, Mika
Nikki, Marko
Järvenpää, Elina
Makkonen, Kaisa
Pinola, Pekka
Palsio, Tuula
Niemensivu, Arttu
Tervonen, Osmo
Tiulpin, Aleksei
author_facet Raisuddin, Abu Mohammed
Vaattovaara, Elias
Nevalainen, Mika
Nikki, Marko
Järvenpää, Elina
Makkonen, Kaisa
Pinola, Pekka
Palsio, Tuula
Niemensivu, Arttu
Tervonen, Osmo
Tiulpin, Aleksei
author_sort Raisuddin, Abu Mohammed
collection PubMed
description Wrist Fracture is the most common type of fracture with a high incidence rate. Conventional radiography (i.e. X-ray imaging) is used for wrist fracture detection routinely, but occasionally fracture delineation poses issues and an additional confirmation by computed tomography (CT) is needed for diagnosis. Recent advances in the field of Deep Learning (DL), a subfield of Artificial Intelligence (AI), have shown that wrist fracture detection can be automated using Convolutional Neural Networks. However, previous studies did not pay close attention to the difficult cases which can only be confirmed via CT imaging. In this study, we have developed and analyzed a state-of-the-art DL-based pipeline for wrist (distal radius) fracture detection—DeepWrist, and evaluated it against one general population test set, and one challenging test set comprising only cases requiring confirmation by CT. Our results reveal that a typical state-of-the-art approach, such as DeepWrist, while having a near-perfect performance on the general independent test set, has a substantially lower performance on the challenging test set—average precision of 0.99 (0.99–0.99) versus 0.64 (0.46–0.83), respectively. Similarly, the area under the ROC curve was of 0.99 (0.98–0.99) versus 0.84 (0.72–0.93), respectively. Our findings highlight the importance of a meticulous analysis of DL-based models before clinical use, and unearth the need for more challenging settings for testing medical AI systems.
format Online
Article
Text
id pubmed-7971048
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-79710482021-03-19 Critical evaluation of deep neural networks for wrist fracture detection Raisuddin, Abu Mohammed Vaattovaara, Elias Nevalainen, Mika Nikki, Marko Järvenpää, Elina Makkonen, Kaisa Pinola, Pekka Palsio, Tuula Niemensivu, Arttu Tervonen, Osmo Tiulpin, Aleksei Sci Rep Article Wrist Fracture is the most common type of fracture with a high incidence rate. Conventional radiography (i.e. X-ray imaging) is used for wrist fracture detection routinely, but occasionally fracture delineation poses issues and an additional confirmation by computed tomography (CT) is needed for diagnosis. Recent advances in the field of Deep Learning (DL), a subfield of Artificial Intelligence (AI), have shown that wrist fracture detection can be automated using Convolutional Neural Networks. However, previous studies did not pay close attention to the difficult cases which can only be confirmed via CT imaging. In this study, we have developed and analyzed a state-of-the-art DL-based pipeline for wrist (distal radius) fracture detection—DeepWrist, and evaluated it against one general population test set, and one challenging test set comprising only cases requiring confirmation by CT. Our results reveal that a typical state-of-the-art approach, such as DeepWrist, while having a near-perfect performance on the general independent test set, has a substantially lower performance on the challenging test set—average precision of 0.99 (0.99–0.99) versus 0.64 (0.46–0.83), respectively. Similarly, the area under the ROC curve was of 0.99 (0.98–0.99) versus 0.84 (0.72–0.93), respectively. Our findings highlight the importance of a meticulous analysis of DL-based models before clinical use, and unearth the need for more challenging settings for testing medical AI systems. Nature Publishing Group UK 2021-03-16 /pmc/articles/PMC7971048/ /pubmed/33727668 http://dx.doi.org/10.1038/s41598-021-85570-2 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Raisuddin, Abu Mohammed
Vaattovaara, Elias
Nevalainen, Mika
Nikki, Marko
Järvenpää, Elina
Makkonen, Kaisa
Pinola, Pekka
Palsio, Tuula
Niemensivu, Arttu
Tervonen, Osmo
Tiulpin, Aleksei
Critical evaluation of deep neural networks for wrist fracture detection
title Critical evaluation of deep neural networks for wrist fracture detection
title_full Critical evaluation of deep neural networks for wrist fracture detection
title_fullStr Critical evaluation of deep neural networks for wrist fracture detection
title_full_unstemmed Critical evaluation of deep neural networks for wrist fracture detection
title_short Critical evaluation of deep neural networks for wrist fracture detection
title_sort critical evaluation of deep neural networks for wrist fracture detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7971048/
https://www.ncbi.nlm.nih.gov/pubmed/33727668
http://dx.doi.org/10.1038/s41598-021-85570-2
work_keys_str_mv AT raisuddinabumohammed criticalevaluationofdeepneuralnetworksforwristfracturedetection
AT vaattovaaraelias criticalevaluationofdeepneuralnetworksforwristfracturedetection
AT nevalainenmika criticalevaluationofdeepneuralnetworksforwristfracturedetection
AT nikkimarko criticalevaluationofdeepneuralnetworksforwristfracturedetection
AT jarvenpaaelina criticalevaluationofdeepneuralnetworksforwristfracturedetection
AT makkonenkaisa criticalevaluationofdeepneuralnetworksforwristfracturedetection
AT pinolapekka criticalevaluationofdeepneuralnetworksforwristfracturedetection
AT palsiotuula criticalevaluationofdeepneuralnetworksforwristfracturedetection
AT niemensivuarttu criticalevaluationofdeepneuralnetworksforwristfracturedetection
AT tervonenosmo criticalevaluationofdeepneuralnetworksforwristfracturedetection
AT tiulpinaleksei criticalevaluationofdeepneuralnetworksforwristfracturedetection