Cargando…

An increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics: are these externally validated and ready for clinical application?

AIMS: The number of convolutional neural networks (CNN) available for fracture detection and classification is rapidly increasing. External validation of a CNN on a temporally separate (separated by time) or geographically separate (separated by location) dataset is crucial to assess generalizabilit...

Descripción completa

Detalles Bibliográficos
Autores principales:	Oliveira e Carmo, Luisa, van den Merkhof, Anke, Olczak, Jakub, Gordon, Max, Jutte, Paul C., Jaarsma, Ruurd L., IJpma, Frank F. A., Doornberg, Job N., Prijs, Jasper
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	The British Editorial Society of Bone & Joint Surgery 2021
Materias:	Meta-Analysis
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8558452/ https://www.ncbi.nlm.nih.gov/pubmed/34669518 http://dx.doi.org/10.1302/2633-1462.210.BJO-2021-0133

_version_	1784592564790231040
author	Oliveira e Carmo, Luisa van den Merkhof, Anke Olczak, Jakub Gordon, Max Jutte, Paul C. Jaarsma, Ruurd L. IJpma, Frank F. A. Doornberg, Job N. Prijs, Jasper
author_facet	Oliveira e Carmo, Luisa van den Merkhof, Anke Olczak, Jakub Gordon, Max Jutte, Paul C. Jaarsma, Ruurd L. IJpma, Frank F. A. Doornberg, Job N. Prijs, Jasper
author_sort	Oliveira e Carmo, Luisa
collection	PubMed
description	AIMS: The number of convolutional neural networks (CNN) available for fracture detection and classification is rapidly increasing. External validation of a CNN on a temporally separate (separated by time) or geographically separate (separated by location) dataset is crucial to assess generalizability of the CNN before application to clinical practice in other institutions. We aimed to answer the following questions: are current CNNs for fracture recognition externally valid?; which methods are applied for external validation (EV)?; and, what are reported performances of the EV sets compared to the internal validation (IV) sets of these CNNs? METHODS: The PubMed and Embase databases were systematically searched from January 2010 to October 2020 according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement. The type of EV, characteristics of the external dataset, and diagnostic performance characteristics on the IV and EV datasets were collected and compared. Quality assessment was conducted using a seven-item checklist based on a modified Methodologic Index for NOn-Randomized Studies instrument (MINORS). RESULTS: Out of 1,349 studies, 36 reported development of a CNN for fracture detection and/or classification. Of these, only four (11%) reported a form of EV. One study used temporal EV, one conducted both temporal and geographical EV, and two used geographical EV. When comparing the CNN’s performance on the IV set versus the EV set, the following were found: AUCs of 0.967 (IV) versus 0.975 (EV), 0.976 (IV) versus 0.985 to 0.992 (EV), 0.93 to 0.96 (IV) versus 0.80 to 0.89 (EV), and F1-scores of 0.856 to 0.863 (IV) versus 0.757 to 0.840 (EV). CONCLUSION: The number of externally validated CNNs in orthopaedic trauma for fracture recognition is still scarce. This greatly limits the potential for transfer of these CNNs from the developing institute to another hospital to achieve similar diagnostic performance. We recommend the use of geographical EV and statements such as the Consolidated Standards of Reporting Trials–Artificial Intelligence (CONSORT-AI), the Standard Protocol Items: Recommendations for Interventional Trials–Artificial Intelligence (SPIRIT-AI) and the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis–Machine Learning (TRIPOD-ML) to critically appraise performance of CNNs and improve methodological rigor, quality of future models, and facilitate eventual implementation in clinical practice. Cite this article: Bone Jt Open 2021;2(10):879–885.
format	Online Article Text
id	pubmed-8558452
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	The British Editorial Society of Bone & Joint Surgery
record_format	MEDLINE/PubMed
spelling	pubmed-85584522021-11-09 An increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics: are these externally validated and ready for clinical application? Oliveira e Carmo, Luisa van den Merkhof, Anke Olczak, Jakub Gordon, Max Jutte, Paul C. Jaarsma, Ruurd L. IJpma, Frank F. A. Doornberg, Job N. Prijs, Jasper Bone Jt Open Meta-Analysis AIMS: The number of convolutional neural networks (CNN) available for fracture detection and classification is rapidly increasing. External validation of a CNN on a temporally separate (separated by time) or geographically separate (separated by location) dataset is crucial to assess generalizability of the CNN before application to clinical practice in other institutions. We aimed to answer the following questions: are current CNNs for fracture recognition externally valid?; which methods are applied for external validation (EV)?; and, what are reported performances of the EV sets compared to the internal validation (IV) sets of these CNNs? METHODS: The PubMed and Embase databases were systematically searched from January 2010 to October 2020 according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement. The type of EV, characteristics of the external dataset, and diagnostic performance characteristics on the IV and EV datasets were collected and compared. Quality assessment was conducted using a seven-item checklist based on a modified Methodologic Index for NOn-Randomized Studies instrument (MINORS). RESULTS: Out of 1,349 studies, 36 reported development of a CNN for fracture detection and/or classification. Of these, only four (11%) reported a form of EV. One study used temporal EV, one conducted both temporal and geographical EV, and two used geographical EV. When comparing the CNN’s performance on the IV set versus the EV set, the following were found: AUCs of 0.967 (IV) versus 0.975 (EV), 0.976 (IV) versus 0.985 to 0.992 (EV), 0.93 to 0.96 (IV) versus 0.80 to 0.89 (EV), and F1-scores of 0.856 to 0.863 (IV) versus 0.757 to 0.840 (EV). CONCLUSION: The number of externally validated CNNs in orthopaedic trauma for fracture recognition is still scarce. This greatly limits the potential for transfer of these CNNs from the developing institute to another hospital to achieve similar diagnostic performance. We recommend the use of geographical EV and statements such as the Consolidated Standards of Reporting Trials–Artificial Intelligence (CONSORT-AI), the Standard Protocol Items: Recommendations for Interventional Trials–Artificial Intelligence (SPIRIT-AI) and the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis–Machine Learning (TRIPOD-ML) to critically appraise performance of CNNs and improve methodological rigor, quality of future models, and facilitate eventual implementation in clinical practice. Cite this article: Bone Jt Open 2021;2(10):879–885. The British Editorial Society of Bone & Joint Surgery 2021-10-20 /pmc/articles/PMC8558452/ /pubmed/34669518 http://dx.doi.org/10.1302/2633-1462.210.BJO-2021-0133 Text en © 2021 Author(s) et al. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution Non-Commercial No Derivatives (CC BY-NC-ND 4.0) licence, which permits the copying and redistribution of the work only, and provided the original author and source are credited. See https://creativecommons.org/licenses/by-nc-nd/4.0/
spellingShingle	Meta-Analysis Oliveira e Carmo, Luisa van den Merkhof, Anke Olczak, Jakub Gordon, Max Jutte, Paul C. Jaarsma, Ruurd L. IJpma, Frank F. A. Doornberg, Job N. Prijs, Jasper An increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics: are these externally validated and ready for clinical application?
title	An increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics: are these externally validated and ready for clinical application?
title_full	An increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics: are these externally validated and ready for clinical application?
title_fullStr	An increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics: are these externally validated and ready for clinical application?
title_full_unstemmed	An increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics: are these externally validated and ready for clinical application?
title_short	An increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics: are these externally validated and ready for clinical application?
title_sort	increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics: are these externally validated and ready for clinical application?
topic	Meta-Analysis
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8558452/ https://www.ncbi.nlm.nih.gov/pubmed/34669518 http://dx.doi.org/10.1302/2633-1462.210.BJO-2021-0133
work_keys_str_mv	AT oliveiraecarmoluisa anincreasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT vandenmerkhofanke anincreasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT olczakjakub anincreasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT gordonmax anincreasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT juttepaulc anincreasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT jaarsmaruurdl anincreasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT ijpmafrankfa anincreasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT doornbergjobn anincreasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT prijsjasper anincreasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT anincreasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT oliveiraecarmoluisa increasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT vandenmerkhofanke increasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT olczakjakub increasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT gordonmax increasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT juttepaulc increasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT jaarsmaruurdl increasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT ijpmafrankfa increasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT doornbergjobn increasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT prijsjasper increasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication AT increasingnumberofconvolutionalneuralnetworksforfracturerecognitionandclassificationinorthopaedicsaretheseexternallyvalidatedandreadyforclinicalapplication

An increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics: are these externally validated and ready for clinical application?

Ejemplares similares